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Transmission Characteristics of a Three- 
Conductor Coaxial Transmission Line 
with Transpositions 


By G. RAISBECK and J. M. MANLEY 


(Manuscript received October 30, 1957) 


This paper describes in detail the manufacture of, measurements on, and 
computations about a three-conductor coaxial cable with transpositions. By 
reducing skin-effect losses in the central conductor, this three-conductor line 
achieves 17 per cent lower attenuation at a particular frequency than does a 
two-conductor line of the same outer diameter. The matrix method used for 
analysts can easily be extended to n-conductor lines. Suggestions for making 
improved three-conductor coaxial lines are made. 


I. SUMMARY 


Most of the loss in a simple coaxial cable is due to series resistance in 
the center conductor. At high frequencies this resistance is aggravated 
by the skin effect, which causes conduction currents to concentrate in a 
thin layer on the surface of the conductor. Because of the skin effect, the 
effective cross-sectional area of a conductor is smaller than its geometric 
cross section at high frequencies. 

The effective cross section of the center conductor of a new coaxial 
line has been increased by dividing the conductor into two parts, a solid 
central member and a concentric thin shell insulated from the center 
member. Over a certain range of frequencies, conduction currents are 
distributed throughout the whole thickness of the thin cylindrical shell 
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and a layer one skin-depth thick on the surface of the solid center mem- 
ber. 

The total current in the composite center conductor is divided into 
two components, one flowing in the solid center member and one flowing 
in the thin cylindrical shell. The effective resistance of the composite 
center conductor is less than that of a solid conductor only when these 
two current Components are approximately equal both in phase and in 
magnitude, and then only over a limited frequency range. The desired 
division of current has been achieved by electrical transposition of the 
two parts of the inner conductor at intervals of less than one-eighth of a 
wavelength. 

Several thousand feet of such a three-conductor transmission line have 
been constructed with transpositions at various intervals. Detailed meas- 
urements and calculations have been made, and the three-conductor 
cable has been compared directly with a similarly constructed conven- 
tional two-conductor cable. The three-conductor cable has lower at- 
tenuation than the two-conductor cable at frequencies between 1 and 
10 me with transpositions 9 feet apart or less. Near 4 me, the three- 
conductor cable has 17 per cent less attenuation than does the two-con- 
ductor cable. The agreement between measurements and computations 
is excellent. The improvement in the three-conductor cable can be in- 
creased to 25 per cent by changing the relative dimensions of the parts 
of the center conductor, and the frequency range over which it sur- 
passes a two-conductor cable can be increased thereby to 20:1. 


Il. BACKGROUND 


2.1 Introduction 


A transmission line with three parallel conductors can support two 


» mee ie . =. . 
modes of transmission at low frequency. Each mode has a well-defined 


phase velocity and attenuation at every frequency. It might be assumed 
that the transmission losses on such a line are minimized if power is 
transmitted only in the mode having the lower attenuation. If the line 
is sufficiently long, this is true; but for short sections of line it is not. The 
losses in a short section of line may be reduced, under some circum- 
stances, by a judicious combination of pure modes. 

The reason why the losses may be smaller for a combination of pure 
modes than for any of the modes separately depends on constructive 
and destructive interference. If two pure modes propagate with different 
wave velocities, the currents which they induce in the conductors will 
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be successively in phase, out of phase, in phase again, and so on. When 
the currents are in phase, the losses are greater than when the currents 
are out of phase. Over a long length of line, the phases will pass through 
many cycles, and the amount of power lost will average out to be the 
same as if the pure modes were propagated independently. 

If two modes are propagated over a short length of line so that there 
is substantial cancellation of currents, the losses will be reduced so long 
as the currents do cancel substantially. When the differential phase shift 
between modes begins to be appreciable, it will be necessary to terminate 
the line, draw off the power and launch it in a new length of line. If this 
is not done, the reduction in losses will be offset precisely by an increase in 
losses when the currents of the two modes become in phase. 

This phenomenon can be used to combat losses due to skin effect in a 
coaxial line. In a simple coaxial line, most of the effective resistance of 
the line is in the center conductor. This effective resistance can be re- 
duced if the center conductor is replaced by a smaller center conductor 
surrounded by a thin coaxial tubular conductor, provided the magnitudes 
and phases of the currents in these two inner conductors bear the right 
relation. As will be shown, this can be accomplished by transposing the 
two parts of the center conductor, which is equivalent to the process 
described in the previous paragraph. 

The problem of losses in a three-conductor transmission line has much 
in common with that of losses in a two-conductor line that has standing 
waves. The three-conductor line has four waves, one forward and one 
backward in each distinct mode of propagation, and these four waves 
may beat in various forms of destructive and constructive interference. 
In order to get an intuitive grasp of the performance of such a line, it is 
convenient to subdivide the problem into two parts: finding the loss in 
the line for a given distribution of currents, and determining how the 
current distribution which yields the lowest losses for a given power 
transfer can be attained. The answer to the first question tells how much 


a coaxial line can be improved; the answer to the second tells how to do 
it. 


2.2 Losses in a Three-Conductor Coaxial Having a Given Current Distribu- 
tion 
In the next section an accurate calculation of properties of a three- 
conductor coaxial will be made. But a good idea of how much the loss 
may be reduced in this kind of line, under the proper conditions, may be 
obtained by assuming a simple model for the line, as shown in Fig. 1, 
and also assuming that the currents in the three conductors are known. 
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Fig. 1 Three-conductor coaxial transmission line: cross section and sche 
matic longitudinal section showing surface resistance. 


The loss in such a line is given approximately by 
L = 43)1, Re" 
+ 3f\ Ti |’Raa” + (ils + Kls)Ra” + | Is Rw” | 
+4 [13 |’Raa’, 


where J; and /; are the current amplitudes in the inner and outer con- 
ductors respectively and the resistances are the surface resistances cal- 
culated from formulas (83) in Schelkunoff’s paper’ on cylindrical trans- 
mission lines.* The superscripts in parentheses refer to the particular 
conductor being considered; R,, is the resistance of the inner surface of 
a tube per unit length to current which is returned entirely inside the 
surface, Ry, is the resistance of an outer surface of a tube per unit length 
to current which is returned entirely outside the surface and R,» is the 
transfer resistance from one surface of a tube to the other. 

Of the five terms in the equation above, the first is the ohmic loss in 
the center conductor. The second, third and fourth terms are the loss in 
the intermediate conductor. The resistances Rag, Ra» and Ry» are real 
functions. 

It is obvious that only one of the five loss terms depends on the rela- 
tive phases of J; and J; . This is the third term, which contains the factor 


(Lil +- 1,13) 2/1, | | Is | cos 8, 


* The reader should note that the first-order correction terms in Schelkunoff’s 
formulas (82) are in error; formulas (75), from which they were derived, are cor 
rect. Briefly, Zaa , Zan, Zoa and Z», are the ratios of longitudinal electric field 
strength on the inner or the outer surface of a tubular conductor to the part of 
the total current returning either inside or outside the tube. The real parts are 
denoted by R with the same subscripts. For details, see Ref. 2, pp. 554-558. 
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where @ is the phase difference between /; and /; . Any reduction in losses 


depends on having /,» large and choosing /; and /; so that the term con- 


taining R,, is negative. 

These theoretical calculations show that the losses in a coaxial trans- 
mission line can be reduced by using a third conductor. Suppose, for 
example, that the skin depth, the thickness of the inner dielectric layer 
and the thickness of the intermediate conductor, are all small relative to 
the radius of the center conductor. Then the various loss formulas are 
simplified and it is easy to compare the losses in the two innermost con- 
ductors with the loss in a solid conductor of the same over-all dimen- 
sions. This comparison is made in Fig. 2. Here, the dotted and dashed 
curves show the resistances of the two conductors compared to the re- 
sistance of a single solid conductor. For the dotted curve, the distribution 
of currents is chosen to minimize the resistance. The required distribu- 
tion is shown in the solid curve, which gives the best ratio 7;/J,. The 
dashed curve shows the relative resistance if the current is divided 
equally in the two inner conductors. In either case, the resistance may 
be reduced to 65 per cent of that of a solid wire for a particular frequency. 

In a normal coaxial line 78 per cent of the effective resistance is in the 
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Fig. 2— Comparison of losses in simple and compound center conductors for 
certain current distributions. 
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center conductor and 22 per cent is in the outer conductor. When the 
center conductor resistance is reduced 35 per cent, the attenuation is 
reduced not 35 per cent but roughly 78 per cent X 35 per cent = 27 
per cent. 

From the curves of Fig. 2 we can conclude that, if the current in a 
suitably proportioned three-conductor coaxial line can be distributed so 


that the two innermost conductors carry equal currents, at one par- 


ticular frequency the attenuation can be reduced to 73 per cent of the 
attenuation of a two-conductor coaxial line of the same over-all dimen- 
sions and made of the same materials. The attenuation of the three- 
conductor line will be less than that of the two-conductor line over a 
20:1 range of frequencies, but at extremely low or extremely high fre- 
quencies it will have higher attenuation. 


2.3 How-.Can the Desired Current Distribution Be Achieved? 


The previous section shows that a coaxial transmission line having 
three conductors, the two innermost of which are very close together, 
will have lower losses than a conventional coaxial cable if the currents 
in the two innermost conductors are approximately equal. Two inde- 
pendent ways to achieve this equal division of current have been sug- 
gested. One way* is to alter the dimensions and dielectric constants so 
that the line has a natural mode of propagation with approximately the 
desired current distribution. The other proposal is to transpose conduc- 
tors in the manner of a litz wire to get the desired distribution by brute 
force, so to speak. This paper describes computations and experiments 
with a line having such transpositions. 


2.4 History 


The idea of transposition between a solid cylindrical conductor and a 
concentric tube is at least as old as U. S. Patent 1,088,902, issued on 
March 3, 1914, to P. V. Hunter. Fig. 4 of that patent shows a transposi- 
tion similar to Fig. 15 of the present text, and the exact wording of the 
specification states: 

“Tf it is desired to arrange the feeders so that the leads have equal im- 
pedance, the cable may be divided into an even number of equal lengths, 
the inner conductor a of one length being, as indicated in Fig. 4 {of the 
patent] connected to the other conductor b of the adjacent length. This ar- 
rangement also prevents alteration to the division of current between the two 
leads by unequal inductive effects on the two leads of currents in adjacent 
feeders or earth.” 


* Proposed by H. 8. Black and 8. P. Morgan, Jr. 
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Hunter was concerned with power distribution systems. The increase 
in resistance at high frequency resulting from the skin effect is not men- 
tioned in his patent. This is reasonable, for at a frequency of 50 cycles 
per second the increase in resistance of a copper conductor even as large 
as one-half inch in diameter is only 0.34 per cent. One can easily under- 
stand, therefore, why Hunter did not apply his technique to the redue- 
tion of skin resistance. 

Later, when the nonuniformity of current distribution at high fre- 


: . ‘ 
quencies became a matter of concern, stranded conductors were used 


rather than transposed coaxial cylinders. At frequencies below one mega- 
cycle per second, stranded conductors may be quite advantageous. 
Above that frequency, the fineness and large number of the strands re- 
quired makes the technique less rewarding. 

The work reported in this paper stems from a systematic search for 
ways to combat skin and proximity effects. The beginning of this search 
was stimulated largely by W. H. Doherty during his tenure as Director 
of Research—Electrical Communications at Bell Telephone Laborato- 


ries. 


2.5 Early Experimental Results 


The first three-conductor coaxial line with transpositions was made 
with a center conductor devised by W. McMahon and fabricated by 
him and G. R. Johnson as shown in Fig. 3. Starting with a bare wire 
30 mils in diameter, flat sections 8 in. long, 120 mils wide and 5 mils 
thick were formed at regular intervals along the wire so that they were 
separated by a length slightly greater than 8 in. This was done by placing 
the wire on a hardened steel plate and passing the plate and wire through 
a rolling mill. The flat sections were then formed into channels in a 
simple die. Then the round sections of the wire were insulated with a 
wrapping of plastic tape. Finally, two such conductors were placed side 
by side so that the flat sections (now channels) of one were matched 
with the round sections of the other, as shown in the figure. The round 
sections were laid in the channels, and the whole structure was passed 
through a die of diameter such that it caused the flat sections to be 
wrapped completely around the insulated round sections. Thus, the con- 
ductors alternate as inner and outer conductors along the line, as shown 
in the figure. 

Resistances measured at various frequencies were compared with 
computations based on the approximate formulas of Section 2.2, which 
were, at that time, the best available. The results are shown in Figs. 4, 
5 and 6. While the correspondence between the measured and computed 
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Fig. 3 Method suggested by W. McMahon for making a transposed com 


pound conductor 





0.0380 CM NNER CONDUCTOR 
ARO Chl OPPER 
0620 CN POLYETHYLENE 

17 CM NSULATION 

0.7200 CM—OUTER CONDUCTOR 

BRASS 

COMPUTED VALUES 

12-6-54 

12-10-54 > MEASURE 

12-30-54] 




















0.4 0.60.8 1 
LES PER SECOND 


Effective resistance of a l-meter length of coaxial line with a trans 


Fig. 4 
posed copper center conductor. 
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Fig. 5 Effective resistance of a l-meter length of coaxial line with a trans- 
posed copper center conductor of different size 














Fig. 6 Effective resistance of a l-meter length of coaxial line with a trans 
posed aluminum center conductor 
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values is not perfect, it is clear that the computed curves are a good 
qualitative approximation to the measured values. 

At that time, it was decided to make a long test sample for further 
measurement. The experimental length was chosen so that: (1) different 
transposition intervals could be studied, (2) attenuation could be meas- 
ured directly in a straight-through transmission measurement, (3) termi- 
nation effects would be small in comparison to transmission effects and 
(4) a control experiment on a two-conductor coaxial line could be easily 
carried out. As this cable was being fabricated and measured, a detailed 
analysis and computation procedure was worked out. Actually, the 
measurements were completed six months before the computations. The 
results of this last experiment and computation are the main subject of 
this paper. 


2.6 Plan of the Paper 


Following this introduction, we shall make a thorough analysis of the 
three-conductor transposed line. In Section III, the propagation prop- 
erties of a uniform three-conductor line are expressed in matrix form. In 


Section IV, a unit of transposed line, made by combining two equal 


lengths of this uniform line and a transposition, is described. A cascade 
of these units is then analyzed as a periodic transmission medium to find 
its new traveling waves and propagation constants. In general, the ex- 
pressions are complicated and numerical analysis is necessary to obtain 
useful results. However, if the spacing between transpositions becomes 
very small, quite tractable forms appear. Further, an approximation to 
these forms yields a very simple understandable picture of the behavior 
of these lines. Following this, Section V is a description of experimental 
work and final results. Details of calculation and tables of results are 
given in Section VI. 


lil. PROPAGATION IN UNIFORM THREE-CONDUCTOR COAXIAL LINES 


Consider the line with transpositions to be made up of a large number 
of equivalent sections each containing a certain length 2/ of transmission 
line and one transposition. For the sake of preserving symmetry, it is 
convenient to place the transposition in the center, flanked by two equal 
lengths J of line, as in Fig. 7. 

It is convenient to use matrix notation to describe the transformation 
of voltages and currents from one end of each of these elements to the 
other because the matrix of the transformation of any number of linear 
networks in cascade is simply the product of the matrices of the individ- 
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Fig. 7 — Schematic representation of a three-conductor transmission line with 
several transpositions 


ual transformations, even though the individual networks are not iden- 
tical (Ref. 6, p. 337). 

First, we will obtain a suitable matrix for the length / of uniform line, 
starting from the differential equations. Carson and Gilbert’ and Schel- 
kunoff have shown how to derive a first integral of Maxwell’s equations 
for a structure consisting of uniform concentric cylinders and shells. For 
the purpose of studying three-conductor coaxial lines it is only necessary 
to adapt Schelkunoff’s general results to the specific case at hand. 

Suppose the cross section of the transmission line is as shown in Fig. 1. 
Let the conductors be numbered, from the innermost out, 1, 2 and 3. 
Let the currents in the several conductors be ie’, ise?’ and ize?“ 
Note that small letters are used instead of the capital letters used before, 
and that the dependence on longitudinal distance x is included. Let the 
voltages of the various conductors be vye’”’, vre’** and v3e”*". To avoid 
ambiguity, we may suppose that voltages are measured radially inward 
from the outermost conductor. It follows that 7, -1; — lo and Ps 0), 
and we can eliminate 7; and v; when desirable. 

Following Schetkunoff, in analogy to his equation (52), we can find 


Jw) ay ’ 
: — E,(b,) 


log = E,(a2) 


27 by 


jwu(t, + 72) Qs 
: log 
2 by 


E.(a,;) — E.(be) — 
2r(g + jwe) 


l 
2a(g + jwe) 


log 
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Here a, and 6, are the inside and outside radii, respectively, of conduct- 
ing shell n, and £,(a,), £,(b,) are the longitudinal electric field strengths 
at the inner and outer surfaces of the nth conductor. 

To these equations must be added Schelkunoff’s equation (74) (modi- 
fied appropriately in notation) for determining the surface electric fields 


E.: 


E',(an) _ ) , a. + i I, c ; 
Eb.) = Zi Ie” + Za" hy”, 


(3) 


where 7,°"’ and J," are the parts of 7, which flow back inside and out- 
side the nth conductor and the Z’s are the artificial surface impedances,* 
the resistances of which were used above. We see from Fig. 1 that 


ra ° + I, ‘ = on ’ 


3 n 
n . . (n+1 
I, = _ > Un . > tn _ —I, ’ 
n+l 1 


I. = 0. 


The terms on the left of the first two equations (1) above are the in- 
ductive reactance voltages resulting from the magnetic field between 
conductors. To simplify the writing, let Xq; and Xq2 be these reactances 


so that 


WH a2 
— log 
2a b, 


Als , 
X a2. 


w 
. log ; 
b, 


or 


In the second pair of equations (2), voltages and currents are related 
by the radial admittances Y; and Y, of the dielectric between conductors. 
That is, 

2x(g + jwe) * 
= j “8 
log(d2/b;) 
2r(g + jwe) _ y 
log(as3/ be) 2 

* Formulae for the surface impedances for the cylindrical line are given in Sec 

tion VI. 
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Mquations (1) and (2) may now be written 


Ory 


Ov; P r r ° . (Pr , \ 
(Zoo + Zi = 2Z at) 11 — (Zoe = Zad)t2, 
or 
Ove fr r \- r : 
- (Zoo — Za)ti - L219, 
Ox 

where 


Zi = Zo ” ms Ba: , oa Xai (S) 


is the effective series impedance of the simple coaxial line consisting of 


conductors | and 2 only, and where 


Ze = Za" + Gas ‘ + jXa2 (Y) 


is the effective series impedance of the simple coaxial line consisting of 
conductors 2 and 3 only. The coefficients in these equations are all per 
unit length of line. 

In matrix notation (7) is 
Lat, 


Ox 


7 2,(x) 
Ole l 


Ox lo(x) 


Ov; v;(2) 


Ox 








Vo( x) 
= © — 


Ove 








| Of | 
This system has four eigenvalues, which are the roots of the charac- 
teristic equation of the matrix (Ref. 6, p. 314; Ref. 7, p. 111): 
determinant [A’ — y/] = 0. 
To each eigenvalue y;(j = 1, 2, 3, 4) corresponds an eigenvector, 


Pi(x) | 
I; t jo(a) 
V5] Vj (x) 








| vj2(x) 
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such that 














where the notations 


Vv) (x) 


V(x)] = 5 and Viz) = [or(x), ve(x) | 


v'o(2x) ead 


are used for column and row matrices respectively. 

The following convention regarding subscripts will be used in the rest 
of this paper: If a voltage or a current bears two subscripts, the first re- 
fers to the number of the eigenvector or mode, and the second to the 
number of the conductor. 


3.1 Traveling Waves and Eigenvectors 


A set of certain simple solutions of the system (10) can be expressed in 
the form 


u(x + A2, (x) e”® 14(x), 
Nio(2) e”* io(x), 

do, (2) e* v,(2), 

t) = dv(x) = e* v2(z). 


To a transmission engineer, each such solution is a traveling wave with 
a propagation constant y. To a mathematician, each such solution is an 
eigenvector with an eigenvalue \. More general solutions can be expressed 


as linear combinations of such elementary solutions. In this particular 


case, as we saw above, there are four traveling waves or eigenvectors, 
and the solutions are 


ti (2) du é vied + ine’ T gi € al + tue’, 
(x) = ie” + tee” + tee + lege’, 
V(x) =_ we". + Vo, e"?* +. ve" + ve", 
v(x) = vee’ + vee’ + vege’ + vee”. 


The eigenvalues 7; , corresponding to the 7 normal traveling waves of 
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the system described by A’, are isolated and shown clearly by factoring 
A’ into the canonical form (Ref. 6, p. 314; Ref. 7, p. 111): 

’ l ~ 
A'=LIrL , (15) 


where 








0x1 Coy U1 Vai 
Ih 2 Ts Vs) 
VilVel Va) V4) 





(18) 


the set of four equations (10) can be reduced by further differentiation 
to two separate sets of two equations: 
a7; avy 
Ox Ox 
Ore 
Ox 
Because )Z is the transpose of ZY, the eigenvalues of these two sys- 


tems are the same. Further, because the solutions have the forms (13) 
and must satisfy a relation like (12), we have 


YZ =v?) ° |. (20) 


jo vj2 


Since the eigenvalues vy: and y;° of (19) are the squares of those of (10), 


we have 
. 
(21) 
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They may be calculated from 
determinant [YZ — y'/] = 0, (22) 


as indicated above. 

The ratios of magnitudes of the several traveling waves in the two 
conductors may be calculated from (20) and a similar equation for the 
voltages after the y’s have been found. 

It may be noticed that if Z,, = 0, we have y = ¥1Zu or YoZo2 , cor- 
responding to two independent coaxial lines. This would be the case if 
the intermediate conductor tube were very thick or the frequency very 
high. 

The above results may be extended to any number of concentric tubes. 
For each added tube, two more equations are added to the original set 
of four, and one additional distinct natural mode which may travel for- 
ward or backward appears. The number of homogeneous equations [cor- 
responding to (20)] is one less than the number of conductors. 

The computation for the three-conductor case we considered was done 
by machine. The results will be described in Sections V and VI. 


IV. TRANSMISSION PROPERTIES OF LINES WITH TRANSPOSITIONS 


In order to combine sections of uniform line and transpositions as in- 
dicated at the beginning of Section III, we need the relation between 
currents and voltages into and out of a line section of length /. This is 


Pie + 2 | as (x) | 
ina + I to(x) 
u(x + 1) if v1 (2) 
| vex + 1) | | vel) | 














The matrix A is related to the matrix A’ discussed in Section III by 
A — LAL < (24) 
where* A = exp (TZ) 


5 . P . vil mm: . . 
is a diagonal matrix with elements A; = e”'. This throws into clear view 
the relation between the differential equations of the line and the trans- 


* The form (23) is derived from that of (10) by considering n sections of length 
Az in cascade, calculating i(zo + nAz) from [t(to + nAz) — i(2o)|/Az, and then 
keeping nAz = | while Az — 0. This is a case of Sylvester’s theorem. ‘A treatment 
is found in Ref. 7, p. 119, where polynomial functions are considered. A general 
proof for converging power series, including exponential series as a special case, 
follows by taking the limit as the sum of a polynomial series. 
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Fig. 8 — Schematic representation of a transposition in a three-conductor 
transmission line. 


mission properties of a finite length. The eigenvectors are the same, and 
the eigenvalues are related through the equation 
dj 


where A; = 1/Az and As; = 1/A,y correspond to (21). 
The transposition shown in Fig. 8, whose matrix we shall call 7, is 
governed by the equations 


where the primed and unprimed letters refer respectively to the two sets 
of terminals of the six-terminal network. In matrix notation, 


TO 








The line with transpositions is made up of a large number of equivalent 
sections, each containing a certain length 2/ of transmission line and one 
transposition at the center, as in Fig. 7. The matrix of the transforma- 


tion becomes 


ATA = LAL 'TLAL". 


Remember that A depends on the length of the section of line, although 
L. Land T do not. 





THE BELL, SYSTEM TECHNICAL JOURNAL, JULY 1958 


The characteristic equation becomes 
det (LAL “TLAL™ = vl| = 0. (28) 


The matrix can be modified without changing its determinant by multi- 
plying on the left by L~‘ and on the right by L, to get 
det [AL 'TLA — vl] = 0. (29) 
Inasmuch as the matrices Z and Y [see (7)] are both symmetrical, ZY 
is the transpose of YZ. Consequently, there is a set of orthogonality re- 
lations among the respective eigenvectors, as follows: 


I Vi] = die + LjVee a= {) 
753A aye Or Aj t1/A;j Ax + 1/A 
1Vil + Vil] = 0 


3 Ave or Ay Ar. 


The use of these relations simplifies the calculation of L>'TL con- 
siderably. The first step is to find L™': 


— _ 


Pu Pwr Pilu Prtie PaV pili | 


P21 PW Pola Polo P2oV2 pals 


P3¥31 Psv32 Pst31 Pst32 PaVs Pals 














| P44 Paae Patar Parae_] | pal 4 Dal a 


where 


l 


} 7 . 31 
Pi = oT VD ve? 


Using, in addition, the relations (21) and consequences, straightfor- 
ward but tedious multiplication yields for the characteristic equation 


(29) 
y* — v'Qa(ar + Ao’ — As — Ae) 
+ ¥[2Q" — (Q? + Q2) (AeA? + Ards’) 
— (QY — Qs’) (A2As + AL Ad)] 
— vQi( + re — As — se) +1 = 0, 
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where 
(VjV32 — Uns) / A, 


i\ Pr p3(un _ Vy. ) ee Pp 


ly Pi p30 aad Vy> ) eae 


A V31032 — V42V31 . 


Equation (31) defines p; and p;, and 
Q7 + Q. — Q; = 1. 

As anticipated, the equation has symmetrical coefficients, which means 
that the roots occur in symmetrical pairs, and also means that a substi- 
tution vy + 1l/»v £ reduces the degree to two. The roots v; of the charac- 
teristic equation (32) are the eigenvalues of the system and are related 
to the effective propagation constants yr; of a transposed line by »; 
e’T:', By this means, the propagation constants of the transposed line 
were derived from the eigenvectors and propagation constants of a sec- 
tion without transposition, with the aid of matrix properties of the line. 
The actual numerical computation was performed on a large electronic 
computer for several different transposition intervals, and the results 
are given both graphically and in tabular form in Sections V and VI. 


4.1 Infinitesimal Spacing of Transpositions 


A special case of the general results just given yields results with much 
less numerical calculation and throws interesting light on the whole sub- 
ject. From it, a very simple picture of the action of transposed lines is 
derived. 

Consider the situation when the uniform line sections on either side of 
a transposition have length dx instead of 1. The matrix A now may be 
formed from A’ by calculating 7;(2 + dx), ete., from di,/dzx. In this way 
we get 

Y, dx —Y, dz 
0 — Y; dx (Y; T Y2) dx 
(Zn + Zu — 2Z a») dx (Zoe = Za) dx l 0 


(Zoe ae Za) dx Zo dx 0 l 








L 


From this, the matrix A7'A for an element of transposed line is ob- 
tained easily. But, for present purposes, it is desirable to maintain the 
positions of 7,(x), z2(2), ete., in the column matrices of (23), and so we 
will consider a unit of length 4 dx having a matrix ATAATA. 





854 THE BELL SYSTEM TECHNICAL JOURNAL, JULY 1958 


Just as was found in Section III, it is easier to handle a system 
like (10) with matrix A’ when the line is divided into equivalent sec- 
tions of infinitesimal length. To do this, 4077,/dz is calculated from 
lin(x + 4dx) — im (x)|/dx, ete. The transposed line composed of these 
very short elements is described, as in (10), by 


al r| 
ax [7] 


where 
asf, 
2Y, + Y:2/2 
and 
Ze + Zu/2 — Za Zw — La 
Zo ae Zab Lo + Zi 2 ie Za 


Ly = 


The eigenvalues yr; of (33) occur in pairs and are calculated from de- 
terminant [Y7Z, — yr/] = 0. The eigenvectors J7;] and Vz, are eal- 
culated from 


as before. The results are 
(Low loss mode) 


Y: (Zo ee Zab 4 2 Zu 4), Yr2 = —-Yn, 


ie = Un Yo/2yn = Ue¥2/2yn, 
toe = Un V2/2yr2 = V2Ye2/2yr, 
(High loss mode) 
(Yi, + ¥2/4)Zu , Yn = —Yr3; 
—tse = va(4¥y + Vo)/2yrs = —vs2e(4Vi + Y2)/2yrs, 
—ty = Un(4¥1 + Y2)/2ym = —ve(4¥i + Yo2)/27n, 


where the 7 subscript on the voltage and current components has been 


dropped for simplicity. 
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It is clear in this case that one mode has the expected current distri- 
bution, 1.e., 2% iy , in Which equal currents flow in the two inner con- 
ductors. This confirms the heuristic analysis in an earlier section, where 
we assumed that such a current distribution could be achieved and con- 
jectured that it could be achieved by transposition at sufficiently short 
intervals. A detailed comparison with the estimate of loss L in Section 
2.2 shows that the predicted loss is the same as estimated before. How- 
ever, the present analysis is more revealing, because it not only tells the 
loss of the desired mode, but also shows the losses and current and 
voltage distributions of both modes. A knowledge of the current and 
voltage distributions is necessary to launch one mode without exciting 
the other. In this case, connecting the two inner conductors together, so 
that vy; = vv. at the launching point, is enough to guarantee that the un- 
wanted mode cannot be excited, for in the unwanted mode v3; — v32 does 
not equal zero. 


4.2 Simple Picture of Attenuation in the Transposed Line 


An interesting picture of the operation of the transposed three-con- 
ductor line can be obtained from (35) for yr when the transposition 
interval is very small, (ym being the propagation constant for the low- 
loss mode). In this expression, Y2 = jS2 may be taken as the radial ad- 
mittance and Za — Za + Zy/4 as the series impedance of an equivalent 
two-conductor coaxial. If the series impedance is R + 7X and R is very 
much less than XY, we have approximately 


x, ¥ 
> 4 SX. 
y ti VS: 


The attenuation (r.p. yr) is proportional to the resistance of the series 
impedance, and we will consider only this: 


R Roe sai Ra + Ry } 


) 3 ‘ > 9 ) a Z Rs, 
Rix sw Rw aes Ra + Vy 
} | 
Here, Raa is the resistance in ohms per meter of length of the outer con- 
ductor; Ry” and R,.” are the surface resistances along the outer and 
inner surfaces of the intermediate tube; A.» is the transfer resistance of 
. l 
this tube and Ry 
Because the intermediate tube is electrically thin for the frequencies 


is the surface resistance of the inner solid conductor. 


of interest, Ry» and R.» are quite flat as functions of frequency and 


very nearly cancel each other. Eventually, Ry,” begins to increase and 
Ra» to decrease with frequency. Until this frequency is reached, the re- 
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sistance of the central conductor of the equivalent two-conductor coaxial 
is given very closely by 


2 1) 
Ro: [Raa + Ry’). 


A physical picture of the hypothetical center conductor which agrees 


with this expression is shown in lig. 9 for a line n meters long. This has 
a real correspondence to the actual transposed line in that each of the 
two inner conductors consists of alternate sections of intermediate and 
inner conductors having the resistances per meter of the two terms above, 
and each carries approximately the same value of current. Of course, the 
transposition interval is not necessarily one meter, as indicated in the 
figure. Fig. 9 also shows a representation of the central solid conductor 
of the reference two-conductor coaxial. It is assumed here that the in- 
termediate tube and its inner insulation are very thin, so that their 
combined diameters are essentially the same as the diameter of the solid 
reference conductor. Its resistance per meter is then Reo : Ru" : 
Formulae for calculating the surface impedances are given in Section 
VI. The first terms of each of the approximations (38) are suitable for 


(1 


bk 


CENTER CONDUCTOR OF 2-CONDUCTOR COAXIAL LINE 
EQUIVALENT TO 3-CONDUCTOR TRANSPOSED LINE 


CENTER CONDUCTOR OF REFERENCE 2-CONDUCTOR COAXIAL 
TOTAL RESISTANCE = NRbb 


RESISTANCE PER METER = Robt 


Fig. 9 — Low-frequency representation of inner and intermediate conductors 
in three-conductor transposed line 
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present purposes and are reliable even at low frequencies for a 
walled tube. Thus 


’ 2 7 l 2 l 
Bas | = 5 ; 
27a» ot 2ragt 
l 
Ry» 
2b, 4 
using (37) also. If, in addition, be, a. and b; are approximately equal, 
| tg 
Ro ;, 
where 
l 
| mugf 


is the skin depth and ¢ is the thickness of the intermediate conductor. 
Then, the ratio of center conductor resistance in the three-conductor 


transposed line to that of the two conductor reference line is 


. R.: 1/6 
K = 22 ow (541), 
: R.» ( ) 


At the frequency for which skin depth equals intermediate conductor 
thickness, the resistance is cut in half. If 78 per cent of the cable loss 
occurs in the center conductor, a 39 per cent reduction of attenuation 
would result. This is quite a bit more than was found experimentally. The 
difference is that, even for 6/¢ 1, R.» does not cancel Ry, exactly. If 
the third terms in coth of and esch ot (the second terms givé reactance 
only) are added to the above simple approximations, we find 


, Rs 1/6 1/ty 
: l ; 
K R, 2 | ( T ) * 6 (‘) 


Now, when 6/¢ is 1, A equals 0.667 and a 33 per cent reduction of re- 
sistance or 25.8 per cent reduction of attenuation is obtained. This is 
very close to the value computed from the whole expression for one-mil 
polyethylene (Fig. 18). When the cubic term is appreciable, the simple 
picture of lig. 9 is no longer sufficient. 

We find K equals one for 6/f = 3 and 6/t = 0.66. The first is the 
lower-frequency crossover point (0.54 me) and the second is the higher cross- 
over point (11.1 me) where the three-conductor transposed line has the 
same loss as the reference two-conductor line. These values agree quite 





2 CONDUCTOR 
OUTSIDE UTSIDE 
DIAMETER H C E THICKNESS 
0.180” 0.0 OUTER .180” 0.004” 
7” NTERMEDIATE .049” 0.0012 
INNER 


A AA ”" 
0.04 


Fig. 10 Cross sections of two-conductor and three-conductor coaxial lines 
used in experiment. 


19 GAUGE 
CONDUCTOR 


POLYETHYLENE 


Fig. 11 Photomicrograph of cross section of the inner conductors of the 
three-conductor coaxial line. 


S5S 
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well with those calculated from the full expression for attenuation and 
plotted on Fig. 18. 
The above simple expression for center conductor resistance reduction 


thus gives a good description of three-conductor transposed line per- 


formance, provided the radii of the two conductors are nearly the same. 

If two thin shells surround an inner conductor to make a four-condue- 
tor transposed line, the simple picture of Fig. 9 may be generalized to 
give 


kK 


for low frequencies. At 6, ¢ 1, this would yield a 67 per cent resistance 
reduction or 52 per cent attenuation reduction. Naturally the cubie term 
is bigger than for the three-conductor line, and its effect would be felt 
at lower frequencies. 


V. EXPERIMENT AND COMPARISON WITH THEORY 


As mentioned above, it was predicted that there would be a reduction 
of attenuation in a coaxial line whose center conductor was laminated, 
with the laminations transposed periodically along the line. Confirmation 
of this prediction by resistance measurements of a three-foot length of 
such a line led to the decision to test the idea more fully by transmission 
measurements on a long section of line. 


5.1 Making the Cable 


This required the manufacture of about a mile of three-conductor co- 
axial cable of unusual design. A design and a method of fabrication were 
worked out in cooperation with members of the Western Electric Com- 
pany. Fig. 10 shows a cross section of the finished cable and its dimen- 
sions. The inner conductor is 19-gauge copper wire over which poly- 
ethylene was extruded to a thickness of 5 mils. Using a special die, the 
intermediate conductor was made by forming a copper tape about | mil 
thick and about 150 mils wide over the insulated wire, leaving a longi- 
tudinal seam. Ordinarily such a seam must be held closed with a helical 
wrapping of some kind, but this was found unnecessary when the second 
and comparatively thick layer of polyethylene dielectric was extruded 
over this intermediate conductor immediately after it had been formed 
by the die. One factor contributing to the success of this operation was 
the leaving of a small gap in the seam rather than an overlap. Micro- 


scopic examination of cross sections (see Fig. 11) showed the gap to be 
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about 5 mils or less. Capacitance measurements also indicated the pres- 
ence of a slight air gap a few tenths of a mil thick between the inner and 
intermediate conductors. The final operation consisted of forming the 
outer conductor, using a copper tape 4 mils thick by 0.600 in. wide and a 
special forming tool. In this case, there is a small overlap at the longi- 
tudinal seam, which is held closed by a helical wrapping of paper tape, 
5 mils by 0.750 in., applied 20 wraps per foot of cable. The effective air 
gap between the intermediate and outer conductors is about 1 mil, as 
determined by capacitance measurements. 

At the same time this cable was made, a two-conductor coaxial cable 
was also made for comparison purposes. The central conductor of this 
cable is a solid copper wire of 0.0470-in. diameter, which is just a little 
less than the outside diameter of the intermediate conductor of the other 
cable. For both cables, the polyethylene dielectric was extruded through 
the same die and the outer conductor was made from the same batch of 
copper and applied in the same way. A cross section with dimensions for 
this cable is shown in Fig. 10. Sections of both cables are shown in Fig. 
12. 

The use of a reference cable made in the way described makes it un- 
necessary for the testing of the attenuation reduction idea to depend on 

0.048" D 19 GAUGE 
COPPER WIRE COPPER WIRE 


A 
0.005" WALL POLYETHYLENE - 
0.001" x 0.146" COPPER TAPE 
0.1696" O.D. POLYETHYLENE 


— 0,004" x 0.600" COPPER TAPE —— 


<< 0,005" x 0.75" PAPER ———— > 


TWO CONDUCTOR THREE CONDUCTOR 


Fig. 12 — Photographs of two-conductor and three-conductor coaxial lines 
used in experiment. 
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a comparison of measurement with calculation. Rather, it depends on 
the comparison of the measured attenuations of two similar real cables, 
one having a solid central conductor, the other having a laminated cen- 
tral conductor in which the laminations are transposed regularly. 

Open and short-circuit impedance measurements showed that the 
characteristic impedance of the three-conductor cable is very near 50 
ohms. According to the estimates of Section II, the frequency at which 
maximum reduction of attenuation occurs is determined by the thick- 
ness of the intermediate conductor. For the cable described above, this 
is about 4 me. 

Because of the way in which they are made, the intermediate and 
outer conductors will buckle or even break if the cable is bent with too 
small a radius. To insure further against possible buckling or breaking, 
it was planned to handle the cable as little as possible. Hence, it was de- 
cided to re-reel the cable in a single layer onto suitable drums as the trans- 
positions were made, so that access to any part of the cable would be 
possible without additional unreeling and re-reeling. 


5.2 Making the First Transpositions 


After the cable was fabricated as described above, the major part of 
the laboratory work consisted of devising and making satisfactory trans- 
positions between the inner and intermediate conductors at regular in- 
tervals. At a point where a transposition was desired, the cable was cut 
and conductors and dielectric were stripped back so that all three con- 
ductors were laid bare at both sides of the cut. By means of special jacks 
and a plug, the center conductor on the left side of the cut was con- 
nected to the intermediate conductor on the right side of the cut and the 
intermediate conductor on the left side was connected to the center con- 
ductor on the right side. The two outer conductors were then connected 
together. Fig. 13 shows the cable ends connected to the jacks. 

Because of the fragile nature of the intermediate conductor and its 
supporting insulation, considerable care had to be exercised in making 
the transpositions. Ragged edges on the 1-mil copper had to be guarded 
against to avoid puncturing the 5-mil polyethylene. Damage to the insu- 
lation from excessive heat was avoided by using low-melting-point solder 
for making electrical connection to the intermediate conductor. 


5.3 First Measurements, Showing the Effect of Too Long a Transposition 
Interval 


At the two ends of the cable, the inner and intermediate conductors 
were connected together to serve as one input and output terminal. Fifty- 
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ohm terminations were used. The results of the transmission measure- 
ments are plotted on Fig. 14, along with those for the reference two- 
conductor coaxial. These were disappointing because, in the frequency 
region where a reduction of loss was expected, two large peaks of in- 
crease were found. Measurements made after changing all the patching 
plugs for straight-through connection gave results very nearly the same 
and very nearly as smooth as those obtained from a continuous piece of 
cable of the same length. Apparently the transpositions and not the dis- 
continuity at the section junctions were responsible for the peaks. With 
only one transposition at the center of the 1180-foot section, there were 
+0.2-db ripples in the loss curve. The effects of reflection at the uni- 
formly spaced transpositions accumulated, with the results already seen. 
Investigation showed the transposition interval to be one-quarter wave- 
length at the first peak and three-quarters of a wavelength at the second. 
Spacing the transpositions twice as far apart lowered the frequencies of 
the peaks by a factor of two. If there was indeed a reduction of attenu- 
ation between 1 and 10 me, it was covered up by the reflection effects. 

At this point, a thorough calculation of transmission loss was begun to 
help understand the phenomenon. Further experimental work was un- 
dertaken at the same time. Since no peaks appeared at frequencies below 
that corresponding to an interval of one-quarter wavelength, it seemed 
reasonable to expect the looked-for reduction of attenuation if the trans- 
position interval was shortened so that the frequency for which it was 


one-quarter wavelength was outside the desired frequency region. Using 


our previous experience as a guide, a spacing of about 9 feet was chosen. 
With this spacing, the first peak should have been at around 20 me. 


Fig. 13 — Technique for making cable connections at a plug-in transposition 
point. 
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Fig. 14 — Attenuation of line with 75-foot transposition spacing (approx.) com 
pared with that of reference two-conductor line and with computations. 


5.4 Final Measurement and Comparison with Theory 


The plug-in transposition in Fig. 13 was too elaborate to be used in 
such large numbers, and not too satisfactory. Therefore, a permanent 
transposition, shown in Fig. 15, was devised. A continuous cable having 
113 sound transpositions and 114 sections in a total length of 1057 feet 
was made ready for test. This time, after transmission measurements 
had been made and compared with those for the two-conductor reference 
line, the expected reduction of attenuation was found, as shown in Fig. 
16. The reduction at 4 me in the center of the region is 17 per cent. 

While the rough estimate of reduction made in Section II was 27 per 
cent, it should be pointed out that this depended on the assumption that 
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INTERMEDIATE LOW MELTING 
CONDUCTOR ) POINT SOLDER 


POLYETHYLENE ~ SOLDER MODIFIED 
LP-476870 
PAPER WRAP TERMINAL 


Fig. 15 — Cutaway illustration of permanent transposition 
the diameter of the solid inner conductor be substantially the same as 


that of the tubular intermediate conductor. In the actual cable, the di- 
ameter of the solid core is 28 per cent less than that of the surrounding 


tube, mainly because of the 5-mil polyethylene insulator. Therefore, a 
reduction of only 15 or 20 per cent should be expected. The actual re- 


duction is right in the middle of this range. 
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Fig. 16 — Attenuation of line with 9-foot transposition spacing (approx.) com- 
pared with that of the reference two-conductor line. 
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Fig. 17 Calculation of attenuation with transpositions at approx. 9 feet, 
approx. 44 feet and very small intervals; comparison with measurements where 
available. 

The calculation of transmission loss described in Section IV was fin- 
ished after the measurements. The results are plotted on Fig. 17 and 


show very good agreement with the measured data. 


The greater attenuation at very low frequencies is caused mainly by 


the transpositions and not by the fact that the central conductor cross 
section is 38 per cent less in the three-conductor cable than it is in the 
two-conductor one. The central conductor of the transposed line at de 
consists of two like structures in parallel. Each of these is a conductor of 
many series sections, with inner and intermediate sections alternating. 
Thus, the considerably higher-resistance intermediate sections dominate 
the situation. At very low frequencies, the attenuation ratio is approxi- 
mately 2, which is slightly greater than the value 1.6 shown at 100 ke. 


5.5 Transposition Interval 


In the calculation it was very easy to shorten the transposition inter- 
val. Two such cases are plotted in Fig. 17. When the spacing is just one- 
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half as long as before, the loss curve is nearly the same, but it crosses 
over the reference cable loss at about 10 me instead of at 8 me. Thus, 
the reflection peak at about 20 me (not shown on the figure) pulls up the 
loss curve a little at the high end. The other case calculated was for in- 
finitesimally small spacing of transpositions, and this gave substantially 
the same loss curve as the 44-foot spacing. We conclude that the reflec- 
tion peak at 40 me has a negligible effect on the loss curve in the region 
of improvement. Also, it is seen that the 17 per cent improvement ob- 
served experimentally would be raised to 18 or 19 per cent by spacing 
the transpositions at about 4-foot intervals. 


The attenuation was calculated also for the cable with transpositions 


75 ft 9 in. apart. The results, in good agreement with the measurements, 
are plotted on Fig. 14. While a random longer spacing of transpositions 
would greatly reduce the sharp peaks of loss in Fig. 14, the way to get 
the smoothest loss curve with the greatest reduction of loss is to make 
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Fig. 18 — Caleulated attenuation of a transposed line similar to that used in 
experiments, but with 1-mil dielectric thickness between innermost conductors. 
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the spacing small enough so that the first peak is well beyond the desired 
operating region. 


5.6 Effect of Spacing Between Inner and Intermediate Conductors 


Another result should be of interest here even though it was not ob- 
tained experimentally. A calculation for loss was made for a line in which 
the inner conductor was enlarged so that the polyethylene separating it 
from the intermediate conductor was only | mil thick instead of 5 mils. 
The results are plotted on Fig. 18. All curves are calculated so that they 
are on the same basis. At 4 me, the transposed three-conductor with 
5-mil polyethylene shows an improvement of 21.2 per cent, and the 
transposed three-conductor with 1-mil polyethylene shows an improve- 
ment of 26.9 per cent. 

The latter case approximates the original assumption of equal diame- 
ters much more closely than does the former, and shows an improvement 
very nearly the same as the 27 per cent prediction of Section IIL. It was 
realized in the beginning that the dielectric separating the inner and 
intermediate conductors should be as thin as possible, but 5 mils was a 
much more practical figure both for fabrication of the cable and for the 
transpositions. These results serve to point the way to further improve- 
ments. 


5.7 Conclusions 


The experiments have compared the transmission loss of two coaxial 
cables which are the same except that the center conductor of one is 
solid while that of the other consists of a shell surrounding and insulated 
from a center core. It has been shown that the transmission loss is less, 
over a certain frequency band, for the cable with the laminated center 
conductor, provided the laminations are transposed at sufficiently fre- 


quent intervals. Also, calculations of transmission loss show very good 


quantitative agreement with measured values, thus tieing the theory to 
the practice. Further, the results show that, if the insulating space be- 
tween intermediate and inner conductors can be reduced to a thickness 
negligible compared to its radius, the reduction of loss can be greater 
and will apply over a greater frequency band. 


VI. NUMERICAL CALCULATION 


6.1 Surface Impedances and Propagation Constants for Uniform Line 


The conductor surface impedances referred to in Sections II and II] 
may be calculated from formulae given in Schelkunoff’s paper.” 
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When the calculations were started, no attenuation reduction had been 
found experimentally, even though measurements had been made. 
Hence, we wanted to make the calculations as exact as possible in order 
that no small effect in the uniform line properties, which might have a 
large effect in the transposed line, would be overlooked. Therefore, the 
exact expressions for the surface impedances [equations (75) in Ref. 2] 
were used as far as possible. Expressed in terms of modified Bessel fune- 
tions of the first and second kind, /, and K,, , respectively, the results are: 


; n Io(ob;) 


Zon 


2rb, Ty ab; ‘ 


Pat n Io(aa)Ky(obs) Ko(ea2)1,(ab2) 
Ae ao : , 
2ras 1 (obo) Ky( eae) I (ca2)Ky(aobe) 


ids n Io(abo) Ky (eas) Ko(ob2) 1, (eae) 
Zu ; ; , 
2rb» I (ob2)Ky(oas) I \(ca2)Ky(abe) 


| l 
2rgasbs 1\(ab2)K (eax) — 1\(ea2)Ky(abe)’ 


ye 
- n Io(cas)Ky(obs) + Ko(cas)1\(ob;) 
Liu _ r - . 

27a; I \(ob.)K\(ca;) — [,(caz:)K,(ab,) 


In these expressions, the b’s and a’s are the radii of the various conductor 
surfaces as shown on Fig. 1, and » and o are the intrinsic impedance and 
the propagation constant of the conductor material. They are 


wh, 
n (1 + 2) 4 “Z 
a (1 + i) Vafug, 
t \ l. 
The dimensions of the cables as supplied by Western Electric are: 
0.0178 in. .52 X 10 * meter, 
0.0234 in. = 5.95 X 10* meter, 
0.0246 in. = 6.25 X 10‘ meter, 
0.0862 in. = 2.19 X 10°° meter, 
0.0902 in. = 2.29 x 10° meter. 
lor the two-conductor reference line, a; and b; are the same as above and 


b.’ = 0.0235 in. = 5.97 & 10° meter. 





CHARACTERISTICS OF THREE-CONDUCTOR COAXIAL LINI 


The cable material constants as supplied by the makers of the cable a1 


m 0 lr X 10 ‘ henry meter 


s) 


a xX 10 ° farad/meter 
obr 
g = 5.858 X 10’ mho/meters 


(at 68°F this is 101 per cent [ACS). 

These impedances were calculated by means of the IBM 650 computer 
from 100 ke to several megacycles at points uniformly spaced on a loga- 
rithmic frequency scale. Each frequency point is higher than the one 


0.1 . : ° 
. In the main program, the libraries used 


preceding it by a factor of 10 
for computing the modified Bessel functions at each point were prepared 
by Miss M. C. Gray. 

For sufficiently high frequencies, approximate formulas for the surface 


impedances are satisfactory. These are: 


| 
a (al P— PS i a 
" 2nd, [ t or | 


} Sigg - a coth a(be 
27a 


” 
coth a(bs 
2rbo | 4 


(38) 


“ac t) coth’ a (be -— a) | 
[ = - (2 - - 1) coth o( o\ — a) | 
n So 


2r Vv AobDe sinh a(be — fy) 


, | eoth o(b; — a3) — slat 4 
2ras 
‘ l i 


The higher-order correction terms are not the same as the ones given 


by Schelkunoff’s paper? which, as pointed out by Miss Gray, are incor- 
rect. These also were computed using the IBM 650 machine. However, 
they went through much faster than the exact ones, and so all the fre- 
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quency points (25) from 100 ke to 25 me were computed. As it turned 
out, the approximations were very close to the exact values of impedance 
except at the lowest few frequency points, and the greatest differences 
were in the reactances. 

The resistance and reactance of Z»'"’ (surface impedance of the inner 


conductor) are fairly close to one another, increasing about as /f and 


having values of 0.1 ohm near 1 me. The three resistances Raa”, Ry” 
and R., of the intermediate conductor are nearly the same and constant 
at about 0.15 ohm up to about 3 me, where the first two begin to increase 
and R,» begins to decrease. The corresponding reactances vary linearly 
with frequency over most of the range, the first two being positive and 
equal to the resistance at about 0.7 me, and X,, being negative and equal 
to Ra» at about 1.5 me. The resistance R,,” begins to increase from 0.015 
ohms around 0.4 me. All these values are per meter of length. 

In the third calculation using the IBM 650, the inductive reactances 
Xq, and X42 (5) of the space between conductors were computed and 
combined with the proper values of surface impedances at each of the 
25 frequency points (exact as far as they went and approximate beyond) 
to give the series impedances Z,, and Z», (8) and (9). The dielectric 
admittances Y; and ¥, (6) and the transfer impedance Z,, were then com- 
bined with these to form the coefficients of the characteristic equation 
(22). The four roots (two distinct) of this equation are the propagation 
constants of the uniform three-conductor coaxial line. Voltage and cur- 
rent ratios were computed at each mode, using (20). These quantities 
are shown in Table I. 

The numbers tabulated in Table I contain eight significant digits, just 
as they came from the IBM 650 computer. This precision would not be 
justified by the primary data if these were final results. But they are 
intermediate results which were used as input data to the computer for 
the transposed line calculation. 


6.2 Transposed Line 


From the characteristic equation (32) for the transposed line section 
of length 2/ having one transposition at its center, the four roots », 
were found and, from these, the four (two distinct) propagation con- 
stants yr; were calculated. This was done for the transposition interval 
21 = 73 ft 9 in. of the first experiment and the interval 27 = 9 ft 34 in. 
of the second experiment, and for an interval of 4 ft 7% in. not done ex- 
perimentally. Only the \; vary with transposition interval 2/, since the 
Q’s depend only on current and voltage ratios of the natural travelling 
waves. These quantities are shown in Table IT. 





TABLE | 


Frequency, m« 


0. 10000000 
0. 15848935 
0.25118873 
0.39810738 
0.63095778 
1 .0000008 
1 .5848947 
2.5118893 
3.9810768 
6. 3095828 
10.000016 
15.848960 
25.118912 


Frequency, m« 


0. 10000000 
0. 15848935 


0.25118873. 


0. 39810738 
0.63095778 
1 . 0000008 
1. 5848947 
2.5118893 
3.9810768 
6. 3095828 
10.000016 
15.848960 
25.118912 


Frequency, m« 


0. 10000000 
0. 15848935 
0.25118873 
0.39810738 
0.63095778 
1. 0000008 
1. 5848947 
2.5118893 
3.9810768 
6. 3095828 
10.000016 
15.848960 
25.118912 


Frequency, m« 


0. 10000000 
0. 15848935 
0.25118873 
0.39810738 
0.63095778 
1 .OO00008 
1. 5848947 
2.5118893 
3.9810768 
6. 3095828 
10.000016 
15.848960 
25.118912 


Unrrorm THREE-Conpuctor LINE 


10°a 


nepers/meter 


468883 1 
0799877 
.8395041 
. 7671483 
9660247 
5580372 
693456 
162726 
SS7865 
. 135279 
>. 526402 
. 384128 
42.047592 


w 


OND BND beet bet es 
SPE OWS DMO Oe 


a 
w 


10°a@ 


nepers, meter 


7318826 
6942080 

». 7525699 
8427964 
9337375 
044322 
241957 
643512 

.433451 

>. 928777 

664341 

>. 361051 
329361 


r.p 


. 18362010 
22474330 
. 27695130 
33751280 
.4088 1000 
48911890 
.56956110 
.62801670 
60888470 
.37242140 
41964610 
2.5566098 
7731939 


2/in (rp 


1816245 
1729224 
1624073 
1501389 
1354390 
. 1181120 
0980730 
0754678 
.0512315 
.0272895 
0063910 
.99185120 
.98612750 


871 


71 


a, T WB, 


1078 


radians/meter 


0228904 
1S76156 
0275308 


7 .9349729 
2.532983 


809733 
329134 
567992 
151609 


10°B 


llans meter 


62320343 


.84536090 


1723877 
6701939 
4480157 
6761608 


.6192649 
.6900187 


536061 
174433 


3.195442 


069905 
679447 


n/t Lp. 


16630834 


. 22944720 


30501473 
40492969 


.54361980 


73916345 


.0221795 


1417420 


2.0726476 


3.0234415 


4235045 


3. 2815904 


8207114 


»/t i.p 


025519684 


.0323958 13 
.038792862 
.045454252 


052262004 


.058627941 
.063870189 
.066832765 


065599039 
060118625 
048717882 
032878953 


.016017054 





Frequency, m¢ 


0. 10000000 
0. 15848935 
0.25118873 
0.39810738 
0.63095778 
1 .0000008 
15848947 
2.5118893 
3.9810768 
6. 3095828 
10.000016 
15.848960 
25.118912 


Frequency, m«¢ 


0. 10000000 
0. 15848935 
0. 25118873 
0.39810738 
0.63095778 
1 .QOOO0OS 
1. 5848947 
2.5118893 
3.9810768 
6. 3095828 
10.000016 
15.848960 
25.118912 


Frequency, m«¢ 


0. 10000000 
0. 15848935 
0.25118873 
0.39810738 
0.63095778 
1 .0000008 
1 .5848947 
2.5118893 
3.9810768 
6 . 3095828 
10.000016 
15.848960 
25.118912 


Frequency, m« 


0. 10000000 
0. 15848935 
0.25118873 
0.39810738 
0.63095778 
1 .0000008 
1. 5848947 
2.5118893 
3.9810768 
63095828 
10.000016 
15.848960 


5. 
25.118912 


TABLE | 


Concluded 


r.p.) 


84589800 
85192153 
85932664 
86810431 
87885470 
89191254 
90761553 
92625098 


.947530389 


97011293 
99132599 
0071079 
0137993 


(r.p.) 


9917880 
1786147 
6316601 
2145871 
88363188 


.6 2260636 


41596602 


. 25394751 
. 13047707 
.040132660 


021255900 
055588468 
063935269 


r.p.), ohms 


329377 


.639675 
. 548062 
3.030268 
}. 205314 
.947652 


776202 


.541720 
.405167 
. 247510 
2.817517 
.135311 


13 
5 
6 


6. 


6. 


OSS65 


(r.p.), ohms 


.7971713 


9324594 
0322898 
1941820 
5156971 


7 .0303966 
—7 .7530091 
8 .6694443 
.7168424 


780327 


699977 


286170 
411509 


1 (i.p.) 


018268992 
023529840 
.026878194 
.034308091 
040451987 
.046767278 
.052792147 
.057560082 
.059488464 
.056772712 
047989710 
033385241 
016466337 


(i.p.) 


2.7097173 


2242917 
7969975 
1571936 
.1750175 
94088943 
. 74652410 
58298677 
.44414391 
. 32580464 
. 22404223 
13656873 
064322522 


(i.p.), ohms 


. 1101280 


>. 2732360 


.518517 
.66609 1 


5.523024 
).084481 


.973284 


2.748214 


.27411 


4.12323 
) 


.93510 
. 23818 
.61598 


n (i.p.), ohms 


. 80779950 
. 10551400 
.46269910 
.89163540 
.3739212 
. 8609257 


2.2944494 


. 5937969 


2.6696293 
2.4534229 


. 9224920 
. 1462748 
. 383026606 





CHARA¢ 


TABLE 


Frequency, m« 


0.1000 
0.1585 
). 2512 
0.3981 
0.6310 
1.0000 
1.585 
1.995 
2.512 
3.162 
3.981 
5.012 
6.310 
7.9438 
10.00 
12.60 
15.85 


Frequency, m¢ 


1.000 
1.585 
2.512 
3.981 
6.310 
10.00 
12.60 
15.85 


Frequency, m«¢ 


0.1000 
0.1585 
).2512 
0.3981 
0.6310 
1.000 
1.585 
2.512 
3.981 
6.310 
10.00 
12.60 


TERISTICS 


OF THREE-CONDUCTOR COAXIAL LINI 


LINE 


THREE-CONDUCTOR COAXIAL 
IB yi 


TRANSPOSED 


YTi 


ari T 


Transposition Interval 2/ = 73 ft 9 in 
10°a 72 
nepers 
1.204 0.07565 
] L187 
l 1871 

l 9955 
l 1679 
l 0.7471 
$.O1S 1.209 
15 56 | 1.475 
y 

$ 

35 

#3 

} 


257 


oi 0 
320 0 
399 0 
519 0 


776 


07 1.415 

0.9181 
0.3169 
0.4368 
1.367 

0.6606 
0.8457 
).51038 
1.0378 


l 


Transposition Interval 21 ft 731 
1Wayi2l Bri2 


nepers radian 


0.04610 
0.07292 
0.1153 
0.1825 
0.2890 
0.4579 
0.5767 
0.7273 
Transposition Interval 2/ = 9 ft 34 
108a72/ 
nepers 


Bri2 
radians 


| Attenuation, 
db/1057 ft 


0.009458 
0.01486 
0.02337 
0.03690 
0.05830 
0.09221 
0.1459 
2.990 0. 2308 
3.804 0.3655 
5.301 0.5798 
9.102 0.9258 
16.45 1.185 


19] 
558 
638 
739 
S82 
103 
$40 
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TABLE I] Concluded 


Frequency, m« po Ll 
. 1000 1.2450 0.01740 
. 1585 1.4960 0.02391 
. 2512 .776 0.03354 
.3981 072 0.04826 
.6310 0.07121 
1.000 0.1073 
585 0.1640 
2.512 0.2535 
3.981 0.3947 
».310 0.6180 
.00 0.9763 
60 1.243 


SOS Se OWN N 


yr = @& + JB 
Transposition Interval Infinitesimal 


5-mil Dielectric 


108°: rn 
nepers/meter radian/ meter 


Frequency, m¢ 


1000 5 4! 0.003347 
. 1585 0. 556! 0.005521 
. 2512 0.5856 0.008271 
3981 0.6215 0.01306 
.6310 0.6725 0.02063 
1.000 0.7504 0.03263 
1.585 0.8716 0.05161 
2.512 1.053 0.08163 
3.981 1.326 0.1291 
».310 1.780 0.2043 
2.586 0.3233 
3.944 0.51138 
4.842 0.6430 


1-mil Dielectric 


Frequency, m« a+. -. ao 
. 1000 0.5109 0.003331 
. 1585 0.5330 0.005232 
. 2512 0.5565 0.008248 
). 3981 0. 5867 0.01303 
.6310 0.6308 0.02060 
.000 0.6999 0.03258 
585 0.8098 0.05155 
2.512 0.9770 0.08156 
3.981 . 232 0.1290 
>.310 1.663 0.2042 
.00 2.439 0.3232 
5.85 3.762 0.5112 
95 +. 368 0.6428 


1958 


Total Attenuation, 


db/1057 ft 


Total Attenuation, 


db/1057 ft 
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TABLE III Two-Conpuctor Ling — y a + j8 


Frequency, m« 


0. 1000 
0.2000 
0.4000 
0.7000 
1.000 


103 
nepers/meter 


0.3350 
0.4252 
0.5547 

.7055 


8344 


B 


radian/meter 


0 
Q. 
0. 
0 
0. 


003342 
006549 
01292 
02242 
03188 


Total Attenuation, 
db/1057 ft. 


0.9373 
1.190 
1. 55: 
a 


( 


2.000 186 0.06332 
4.000 .687 0.1259 

7.000 2.227 0.2197 5. 232 
10.00 2.659 0.3133 141 
14.00 3.145 0.4380 8.799 
20.00 3.756 0.6250 10.51 
25.00 198 0.7808 11.75 


l 


9 


5 
7 
3 

32 
| 
3 
{ 


In the case of infinitesimal spacing of transpositions, the propagation 
constant yr for the low-loss mode was computed from (35) for both the 
actual cable and a hypothetical one in which the intermediate conductor 
was separated from the inner conductor by only 1 mil instead of 5 mils. 
These quantities appear at the end of Table II. 

In order to have a calculated curve to compare these two results with, 
the propagation constant of the two-conductor reference cable was also 
computed by modifying the process used for the uniform line. These 
quantities are shown in Table III. 

The numbers delivered by the computer were rounded off to four sig- 
nificant digits in Tables II and III. In the table of results for the 9 ft 
3; in. interval, the values for infinitesimal spacing were used up to | 
me because the eight-digit precision of the computer was not sufficient 
to calculate the roots of (32) at these frequencies. At 1 me the two meth- 
ods give answers differing by about 0.1 per cent. 
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Synthesis of Series-Parallel Network 
Switching Functions 


By WARREN SEMON* 


Manuscript received February 13, 1958 


From the switchi ng functions of n variables, those which corre spond lo 
networks are abstracted and called network functions. Prope ries of those 
network functions corresponding to series-parallel networks are studied 


and a method for synthesis is developed. 


I. INTRODUCTION 


It is known that one may establish a mathematical model in which 
Boolean polynomials explicitly represent two terminal single-impulse 
series-parallel contact networks. The conventions used in this paper are 
Boolean plus, symbolized by @, to represent parallel connection; Boolean 
times, symbolized by - or by juxtaposition, to represent series connec- 
tion; zero, to represent an open circuit; and one, to represent a closed 
circuit. The symbol ’ is used to represent Boolean negation, also called 
“inversion” or ‘‘complementation.”” Whenever f represents an open 
circuit, f’ represents a closed circuit and conversely. This is a conductance 
analogue, the dual of that used by Shannon.’ If f(a, a2, <<: , 2») isa 
switching function it may be represented in tabular form’ asa “‘canonical- 
form matrix”? whose rows consist of those configurations of the variables 
for which f = 1, as shown below for the function 


¥ ; ; — , ; 
E Ly (Ve &® X3 )= 2 Le Xz ‘S) X'Xo T3 + X;' Lot! . 


* The Computation Laboratory of Harvard University, Cambridge, Mass 


877 





878 THE BELL SYSTEM TECHNICAL JOURNAL, JULY 1958 


Definition 1 


A “series-parallel” network is a series or parallel combination of two series- 
parallel networks; a single element is a series-parallel network.” 


Definition 2 


A “network” function is a switching function of n variables, having no 
vacuous variables, which can be realized by a network containing n switches. 

As an example: f(a , %2, %3 , %4) = xe’ ® 23/74 is a network function. 
A four-switch realization is shown in Fig. 1. 


Definition 3 


A “series-parallel network function” is a network function at least one of 
whose n-switch realizations is a series-parallel network. 

The network of Fig. 1 is series-parallel, hence the corresponding fune- 
tion is a series-parallel network (SPN) function. 

In what follows we will be concerned with the characterization of 
SPN functions and the synthesis of the networks to which they cor- 
respond. It should be noted that, although series-parallel networks com- 
prise a small subclass of the class of all relay contact networks,’ there are 
other types of switching elements for which they represent the only 
electrically useful networks. 


II. GENERAL THEORY 


The orem 1 


The inverse of any SPN function is also an SPN function. 


Proof: A Boolean expression may be derived which corresponds explicitly 














Fig. 1 Four-switch realization. 
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to the series-parallel realization of the function.’ DeMorgan’s law, when 
applied to this expression, yields a Boolean expression which exactly 
corresponds to a series-parallel n-switch realization of the inverse func- 
tion. 

Theorem 1 may be extended to cover all network functions whose 
n-switch realizations are planar.” 


The orem 2 


A minimum generating set* of network functions of n variables may be put 
into a one-to-one correspondence with the distinct? networks of n elements. 
A proof of this theorem has been given by Ashenhurst.° Some implications 
should be noted explicitly: (a) the n-switch realization of a network func- 
tion is unique, and (b) only network functions correspond to networks. 


Theorem 3 


Suppose that {(x; , X2, --*, Xn) isan SPN function. Thenf can be expressed 
asan SPN function of the variables o, , a2, +++, a, where each a is itself 
an SPN function of the variables in one of k nonoverlapping subsets of the 
variables x; , Xo, *** ,Xn. 

Proof: This is obvious from the definitions of SPN function and series- 
parallel network. 


Theorem 4 


Let (x1, X2, «+: , Xn) be an SPN function which in canonical form has 
m terms. Let f be expressed as an SPN function F(a, a2, +++, ax), 
k < n, where each a; is itself an SPN function of 8; variables with m; terms 


in its canonical form (as a function of its S; variables only). In the canonical 


form of F(ai), replace each a; by mi, each a’ by mi) = 2°' — mj, each 
Boolean ® by algebraic + and each Boolean - by algebraic -; then the 
result is m. 

Proof: It is only necessary to prove the theorem for two variables, since 
Definition 1 and Theorem 3 may then be used to complete an inductive 
proof. Suppose f equals aa2.f Each term of the canonical form of f may 
be formed by selecting one of the m, terms of a; and one of the m. terms 


* The group of variable transformations (permutations and inversions of the 
variables) divides any class of switching functions into N nonoverlapping subsets, 
called equivalence classes; a set of N functions, one from each equivalence class, 
is called a ‘‘minimum generating set.” 

+ Two networks are ‘‘distinct’’ if one is not derivable from the other by trans 
formations consisting of the reversal of two-terminal subnetworks. 

t Such a function is said to be “‘essentially series.”’ 
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of a2. All such terms must appear, and no others can. Hence m equals 
mM . 

Now suppose f equalsa; ® ae .* Foreach of the m, terms of a; there are 
2° configurations of the variables of a» which appear in the canonical 
form of f; similarly, 2”! configurations of the variables of a; are associated 
with each of the ms terms of a). This gives 2°'m. + 2°*m, terms. Of 
these, myme are duplicates. Hence 

S| 


m 2° my + 2°?m, — mm, 


myn + mms + myme. 


Theorem 6 


If (x, +++ ,X,) ts any SPN function, and m is the number of terms in its 
canonical form, then m is odd. 
Proof: By induction on n. The theorem may be verified by inspection 
for the cases n l and n = 2. 


Now assume the theorem true for all / < n. Consider an arbitrary 


SPN function of n variables. It can be expressed either as a;ao or as 
a, ® a», where aq; and a» are SPN functions of fewer than n variables. 
Again let m, be the number of terms of a; and m,. the number of terms of 
ay. Then 


my * Mo 
is odd, or 
Se 98 
m = 2°?m, + 2°'me. — myme 


is odd, since m, and m, are odd by the inductive hypothesis. This proves 
the theorem. 


Corollary 


It follows immediately that any network function for which m is even has 
only a bridge circuit realization. 
As an example, considerthefunction f(a, , 2 , 23 , 4, 25), Whose canonical- 
form matrix is given in Fig. 2 (m = 16). It has the five-switch realization 
shown, but no sertes-parallel five-switch realization.t 

It is reasonable to ask whether an odd number of terms in the canon- 
ical form of a network function implies that its realization is series- 
parallel. This is not so, as may be seen from the following example. 

* Such a function is said to be ‘essentially parallel.’’ 


t Note that Theorem 2 asserts that there is no other five-switch realization of 
this function. 





SYNTHESIS OF SWITCHING FUNCTIONS 88] 


The function F(a} ,22,2°3,2'4,2'5,2,27), Whose canonical-form matrix and 
seven-switch realization are shown in Fig. 3, has m 59 but the net- 


work is not series-parallel. 


Theorem 6' 
The function {(x, , X2, +++ , Xn) defined by fj i 3 
and f; 0 for 1=> misan SPN funetion for m odd. 





r. 
_ 


Fig. 2 Five-switch realization 


Definition 4 


The SPN function defined by f; l,i 0,1, ---,m — 1 andf; 0 
for i 2 m, m odd, is called a ‘‘canonical-form network function.” 
All SPN functions of one, two and three variables are, to within a vari- 
able transformation, canonical-form network functions. The first SPN 
functions which are not canonical-form network functions appear in 
four variables. Such functions may be characterized by the following 
discussion. 
Consider the Boolean expression for an SPN function which explicitly 
represents’ the n-switch series-parallel network for the function. Then 
i. There may exist a variable x such that f rg, with g an SPN 
function. 


* Here, f; is the value of the function for the configuration of the variables 
which would represent the integer 7 in the dyadic number system 
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ii. There may exist a variable x such that f = x2 ® g, with g an SPN 

function. 

iii. The function may be such that no x exists which satisfies either 

i or ii. 
In cases i and ii the function is said to be “reducible’’; in case iii, “‘ir- 
reducible.” 

If a function is reducible, then the “residue” function g may be ex- 
amined for reducibility. For any SPN function such reduction will ulti- 
mately lead to a residue function which is (a) a single variable, or (b) 
irreducible and a function of more than one variable. An SPN function 
whose ultimate residue function is a single variable is said to be ‘“‘com- 
pletely reducible.” 


The orem 7 


An SPN function is, to within a variable transformation, a canonical-form 
network function if and only if it is completely reducible. 
Proof: By induction on n, the number of variables. For n = 1 and n 
the theorem is obviously true. 
Now assume the theorem true for all k < n. Suppose f to be an SPN 


= 


A. 
% 3 


Seven-switch realization. 
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function of n variables which is completely reducible. Then either f = 2x’g 
or f = 2’ @ g, and g is a completely reducible SPN function of n | 


variables. The inductive hypothesis then guarantees that fis a canonical- 


form network function. 

Suppose now that f is a canonical-form network function of m terms. 

(a) If m < 2”", then f equals x,’g(a1, -** , 2n—1), Where g is a canon- 

ical-form network function. 

(b) If m > 2”, then f equals 2,’ ® g(a, *** , Yn-1), Where g is a 

canonical-form network function. 
In both cases, the inductive hypothesis guarantees that g is completely 
reducible, hence so is f. This proves the theorem. 

The irreducible SPN functions of n variables are a set of some in- 
terest. One question occurs immediately: How many such functions are 
there? This may be answered as follows. 

Let S, be the number of functions in a minimum generating set of 
SPN functions of n variables. Since, by Theorem 2, the correspondence 
between functions and networks is one-to-one, S, is the number of 
series-parallel networks of n elements.* Let 7, be the number of functions 
in a minimum generating set of irreducible SPN functions of n variables. 


Theorem 8 


Proof: S, is the number of series-parallel networks of n elements, and 
i, is the number of series-parallel networks of n elements which have no 
single elements in series or in parallel with the entire network. Form 
networks of n + 1 elements by adding a single element in series to each 
of the S, networks of n elements; form the corresponding set by adding 
one element in parallel. There are 2S, such n + 1 element networks. 
None of these corresponds to an irreducible function. If the networks 
corresponding to all the irreducible functions are now adjoined, giving 
2S, + tn41, this includes all the SPN functions of n + 1 variables, since 
any reducible function is included in the set of 2S, functions, and all 
irreducible functions are included in the set of 7,4, functions. Hence 
Sno = 28, + ing. It is obvious that there are 2"”' completely redue- 
ible SPN functions of n variables since they are canonical-form network 
functions, and these correspond to the odd integers m, 1 < m S 2”. 
The remaining SPN functions of n variables are each either irreducible 
or reducible to some irreducible function. Theorem 9 exhibits this divi- 
sion. 


* These numbers are discussed in Ref. 3. 
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Theorem 9 


9 


n - 
De Qn’ : . 9jn-ij-l 
S, 0 2 +i, + 2 i. 
j=1 


(a) The term 2"! represents the canonical-form network functions of 
n variables. 

(b) The term 7, represents the irreducible SPN functions of n variables. 

(c) The remaining terms 


represent the SPN functions which are reducible to irreducible functions 
of more than one variable. To prove Theorem 9, it is only necessary to 
solve® the difference equation, S,4; — 2S, = tr4,. Table I shows the 
numbers of interest for n = 1(1)10. 

Let f(v;) be a function of n variables, represented by a canonical-form 
matrix of zeros and ones (n columns, m rows), as in Section I. Let mjo 
and ma m — mio be equal to the number of zeros and ones respec- 
tively in the x; column of the matrix. 


Theorem 10 


If f(x;) 7s an SPN function, it is a function of x;’ or x; according as m 
d d d A ,] J : il 


or Mio ts the lesser. 


TABLE | 


I 0 
2 0 2 
} 0 4 
~ 10 ; 0 8S 
24 é a 16 
66 18 16 32 
180 18 68 64 
§22 162 232 128 
9 1532 {88 788 256 
10 4624 1560 2552 512 


S, = total number (taken from Ref. 3) of SPN functions in a minimum gener 
ating set. 

i» = number of irreducible SPN functions in a minimum generating set. 

r, = number of reducible but not completely reducible SPN functions in a 
minimum generating set. 

C, = number of completely reducible SPN functions in a minimum generating 
set 
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Proof: By hypothesis, f can be written as f = x2;*A © B, where (a) x;* 
is either x; or x,’; (b) A and B are functions of the remaining variables 
only. Then 


x,*A 4 
x;*(A @ B) @ 2;*'B. 


Since B < A @ B, the number of terms in B is equal to or less than the 
number of terms in A © B. Now suppose that 2,;* be equal to x2; , then 
f = 2,(A @ B) @ 2,’B. But the number of terms in B is mj, and the 
number of terms in (A @ B) expanded as a function of all n — 1 vari- 
ables is mj, , hence mj S mj. But m min + mj, is odd, hence mj 
My. 

Similarly, if 2,* x;', mj is the number of terms in (A @ B) and 
m,, is the number of terms in B, so that mj, S mj. Again, the equality 
is impossible, so that mj, < mj. This completes the proof. 

As an example, consider the series-parallel network shown in Fig. 
The corresponding function is 


- , , 
I(x) X1 (ty @D Xo Xz) 


and the canonical-form matrix for f is: 








Fig. 4 Network for f(2) 
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The counts for each element are: 


By Theorem 10, f is a function of 2’, x2’, x3 and 24, and this is clearly 
true. 

Consider any element 2; in a series-parallel network. Let A; represent 
all of that part of the network which is in parallel with x;. Let B, 
represent all of that part of the network which is in series with the 
parallel combination of A; and 2;. Let g, be equal to B,(A; @ 2;). In 
general, let A; be that part of the network which is in parallel with 
gi, and B, be that part of the network which is in series with the 
parallel combination of A; and g;,. Then an SPN function f may be 


expressed as: 


N N 
:@ > Aa IT B,,* 
l i=k 


where A; and B; are SPN functions of distinct variables (see Fig. 5); 
A, may be the trivial function 0, while By may be the trivial function 1. 
Definition 5 

x; 7s said to be a ‘‘series’”’ element if and only if Ay = 0, and x; is said to 
be a ‘‘parallel’”’ element if and only if Ay ¥ 0. 

Theorem 11 


In the circuit for an SPN function, x; is a series (parallel) element if and 
only if the lesser of mjo and mj, ts even (odd). 
Proof: Suppose that f is expressed as: 


N N N 
f=2,[]B:;® DA. IB, 
t=] kel i=k 


with A; and B; SPN functions of nonoverlapping subsets of the vari- 


* Henceforth the symbol J will be used to signify extended application of the 
operation ®. 
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ables. Then 
1,4:,,B) =][B,® A. [[B 
l k 1 i=k 
118.0 A, J] Bi; e 2) A. IB 
i=] i=l k=? i=</ 


Clearly A; is a vacuous variable in this function, and wherever A,P 
appears in the canonical form A;’P appears also (P is a product involy 

ing Ay, ---, Ay, B,, --- , By and their primes). Hence, if m,, is the 
number of terms in A; and A; is a function of S,, variables, then, by 
Theorem 4,m,, + m4,’ = 2°41 isa factorof mj; . If A; 4 0, then S,, 4 0, 
hence m,, is even and mj odd. 


Now suppose A; is identically equal to 0, then 


f(z; = 0,A;,B) = A, JB. 
k l j 
A, J] B;@ >> A. ]]B 
“LAT B, 


and clearly B, is vacuous. Hence, mg, + mz,’ 2°": is a factor of m jo 





a 


e 





am 








N N 
Fig. 5 Network for f = 2;]] B: @ > 
i J A= 
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and mj is even, mj, odd. (Note that S,, = Ss, = O implies f = 0; 
this case need not be considered.) 


The orem 12 


If two elements x; and xx are in series or in parallel, then mio equals 
Myo and Mj, equals My, . 
Proof: lf the function is expanded as 


N 


N 
>a IB 
k=l k 


’ 


and Theorem 4 is applied, the result follows immediately. 

It should be noted that the converse is not true; that is, m jo Mo 
and mj; mi, does not necessarily imply that x; and 2, are in series or 
in parallel. This will be seen in one of the examples which will be given. 


The orem 13 


One of mjo and mj, ts even; call this m. Let k be the largest integer such that 
2k m.* Then k is the number of switches in the subnetwork which is in 
series or tn parallel with x; . 


Proof: Suppose that f is a function of x; (not of z,’). This assumption is 
irrelevant in the argument, and is made only for the sake of definiteness. 
Then mj < mj. 

Case 1: Suppose that x; is a parallel element. Then m jo is odd, by Theorem 
11, hence m = mj. Clearly, in the function f(7; = 1, A;, Bi), Ai isa 
vacuous variable. Therefore, in the canonical form of f(z; = 1, A;, By) 
wherever A,’P appears so does A,P. Hence, by Theorem 4, 


ma = (ma, + ma,')C 


98a, Cc 


But C is odd (if it were even, then removing the subfunction A; would 
make x; a series element, with m jo odd, and this is impossible by Theorem 
11). Therefore 2°41 | m, and 24: ) \ m for k = 1, 2, 

Case 2: If x; is supposed a series element, then clearly m = mj. Also 
B, is vacuous in f(z; = 0, A;, B,). (Note that A; = 0 by hypoth- 


*a| b as usual means “a divides b’’; a \b means ‘‘a does not divide b.” 
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esis.) A similar argument shows that 2°":| m7, while 2°: mfor k 
= 1,2, ---. This proves the theorem. 

Theorem 14 

Consider an SPN function f. Of £ and f’, one is essentially series, say f. 
Then f may be expressed as f = a: B, where a, B are SPN functions of 


nonoverlapping subsets of the variables. If a equals a(x;), then mg | mjo 
and mg | mj; ; tf 8 equals B(x;), then Ma | Mjo and Mg | Mj; . 


Proof: If a equals a(x;) then mg | mj and mg | mj , since every term in 


the canonical form of @ (as a function of its subset of the variables) is 
repeated in the canonical form of f exactly mg times. 


Ill. SYNTHESIS 


Consider the function defined by the canonical-form matrix: 


l 
Suppose it is given that this function is an SPN function, and it is re- 
quired to find the network. 
Table II shows the counts for each variable and the information which 
is derived from these counts by application of Theorems 10 through 14. 
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TABLE I] 


Prime Factorization of 


Series or 
Variable 
Parallel 


The column headed “‘variable’’ is filled in by applying Theorem 10. 
Since in every case mj < mj, only unprimed variables occur. The 
column headed ‘series or parallel” is filled in by applying Theorem 11. 
Since myo , Me and mg are odd, 2 , 2 and 2; are parallel elements; 2; and 
xs are series elements, since my and ms are even. The column headed 
‘“‘k”’ is filled in for use in applying Theorem 13. Since 2 | 10 and 4 } 10, 
the values for 2, and x2 are 1. Also 4| 12 and 8 \ 12 givesk = 2 fora;. 
Similarly for 2, and 2; the values are 1. The last two columns give the 
prime factors of mj and mj, . Since my, = mz = 10 and each of 2, and 22 
is in parallel with a single switch and since no other mj equals 10, 2, 
and 2x2 are in parallel, i.e., (7; ® 2x2) represents a component of the net- 
work. Similarly (2, - 25) represents a component. Since x; is in parallel 
with a two-switch network but not in parallel with 2, (by Theorem 
12), it must be in parallel with x, - x5. Hence (2; @ 2x25) represents a 
component. Since x; cannot be in parallel with x,, these two components 
must be in series, therefore f (a, ® ae) (x; ® xyx;) and the net- 
work is as shown in Fig. 6. Note that if a equals (7; © 22) and 8 equals 
(x3 ® 2425), then m, = 3, mg = 5 and 


Mg | Myo , mg | My , Mg | Meo , Mg | Mo , Mg | M30 , 


Me | M3, , Ma | Mao , Ma | M4, Ma | M50 , Ma | M51 , 


as required by Theorem 14. If Theorems 10 through 14 are to be applied 
mechanically, the following fact should be noted. Once it has been deter- 





Xa pom Tae igen 




















x; X3 





Fig. 6 — Network for f = (2; ® 22) (%3 ® 2425). 
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mined that (2 ® 22) and x42’, represent components, these combinations 
may be treated as single elements; the canonical form may then be 
modified and the count repeated. For example, let A equal (2; @ 22) 
and B equal ayr;. Then the original canonical form may be rewritten 
(in two steps) as 


and this function is obviously A(z; @ B). 
Consider the canonical-form matrix: 


Xi) To 23 He LT 


s 


l 
1 
l 
l 
1 
1 
l 
1 
l 
l 
l 
1 
l 
l 
l 
l 
1 
1 
l 
l 
1 
l 
l 
l 
l 
l 
1 
] 
l 
1 
0 


emt ek et et ee et et et et ft tt tt et et et et et et et et et et et 
a eee er ee ee ee ee ee ee 
eee eee ee ee ee ee  __|) 
ea a ee ee ee es 


ee ee ee ee ee ee ee ee ee 


eee ee ee ee 


a om on 
ee 
~ 


~~ 
— 
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TABLE III 


Prime Factorization of 


Variable 


Jc 

to 
uo 
i) 


2wEe 
+O 


to we 


on 
. ~I-1 
= 
wwoa 
“~~ - 
as 


Tho 02 OD tO to 
we 


- 
to Orbe to tS DW NS 


~J 
Oe 7 a0 
“J 


This gives Table III. It is evident that 

i. one Of aye’, X23 , 2124 appears in the network; 

li. 25/a’ appears in the network; 

ill. as’ ® 2g appears in the network. 
In order to select the correct one of x,2', 21%, 7,41, Table IV is con- 
structed. Note that if zy is to be a component it must be possible to 
substitute A for ry in the canonical form. Then a count can be made 
of mao and m,4,. But A equals zero implies three terms as shown: 


A=1 A=0 


| 
ry 


0] 
10 


Therefore, the number of occurrences of zy = 00, xy = O1 and ry = 10 


must all be equal. Clearly, in the example only the combination 2,22’ 
satisfied this requirement. This implies that 2,22" and 2327, represent 


TABLE IV 
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components. Reducing the canonical form in terms of the new variables 
W" 
Y2 


U3 


gives the following matrix: 


Analysis of the counts for this matrix is shown in Table V. A count for 


Yo, Ys, Yiys Yields Table VI. Therefore 2; = 2,72 ® x30, 2 


X5'X—e @ 27, and 2; xs’ ® xy represent components. Again reducing 
the canonical-form matrix leads to 


Hence the function is 
f (ay2%9" @ 1324) (a5 26" @ Xz) (2x" @ Tg) 


and the network is as shown in Fig. 7. 

It should be noted that an attempt to apply Theorems 10 through 14 
to a non-SPN function wil! lead to a contradiction. For example, the 
function whose canonical-form matrix and bridge network are shown in 





THE BELL SYSTEM TECHNICAL JOURNAL, JULY 1958 


TABLE V 


Prime Factorization of 
Variable S or P 


TABLE VI 


01 














eri: Sevan 


Fig. 7 Network for f = (2,22 ® 


X§ Xp 





23%) (25'26' BM Bz) (2s PB 2p) 
“ABLE VII 
Variable 


, 
v1 
, 
Te 
, 
Ts 
T4 
, 
rs 


L6 
Xz’ 


7 


Fig. 3 yields Table VII. If this function is to be an SPN function, 2,’ 
must be in parallel with a one-element subnetwork. But no other element 


exists with the number pair (34,25) as (mj, mj), and this contradicts 
Theorem 12. 


The results produced thus far by this investigation suggest many 
intriguing possibilities. For example, if a function is not an SPN fune- 
tion because it requires both x; and 2,’ 


contacts, can rules be derived for 
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augmenting the canonical form of the given function so as to produce 
the canonical form of a new function which (a) is an SPN function of 
n + 1 variables and (b) reduces to the given function when the new 
variable is replaced by 2x,’? Obviously this notion can be extended to 
cover all functions by introducing a large enough number of new vari- 
ables. That this is not a trivial question is shown by the following 
example: 
Let f(a, %2 , #3, %4) be defined by the canonical-form matrix 


Since m = 6, it is clear that this function cannot be an SPN function. 
Let us assume however, that it can be realized using five switches in- 
cluding both an 2, and an 2’, and attempt to form a function g(2 , 22, 


X3, %, a) such that (i) g isan SPN function, and (ii) g(a, 22, 43, 2%, 


a = 2x2’) equals f(x, , 2, 23, 24). We proceed as follows: 


1. Copy the canonical form matrix of f, adding a column headed a. 
This column is the inverse of the column headed 2» . 


Condition (ii) requires that these terms appear in the function g. 
2. The canonical form matrix for f’ is similarly augmented, giving 
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and condition (ii) requires that these terms appear in g’. 


3. List the remaining possible terms of g. (These will include all 
combinations for which a = 2».) 





Since these terms will vanish when a is set equal to x2’, any of them may 
appear in the function g. Assuming that g is to be an SPN function of 
primed variables only, it follows that 
(a) whenever, in the function g, the combinations 2,0zx321 or x,1x32,0 
appear, so must 2,0zx32,0, and 
(b) the combination 2123241 may appear in g only if 27,0x32,0, 2023041 
and 2,12327,0 all appear as well. 
4. By the application of these rules the terms in (4) may be divided 
into three sets: (a) those which must appear in g, (b) those which must 
appear in g’ and (c) those which these rules do not determine. For this 
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case, these sets are 


appear in g appear in g not determined 


5. Combining these with the known terms of g and g’ gives 


appear in g appear in g not determined 
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The terms presently given as appearing in g form an SPN function 
g = x22 @ 2;'x,/a' 
and 
g(z, , 2, Zs, 2a, @ = Bs) 2X3 @D Laks Xe) = (2, , La, Ls, Xa): 


However, note that if all the terms listed as “not determined” are ad- 
joined to those which “appear in g,”’ a canonical form for the SPN 
function 


; , 


OS, Ss 5 Bs, Bay O) (xy) @ a’) (ao’ @ 23'24') 


is produced, and 


g(x, , H2,2%3, Xs, Xe (a1 ® te) (xe! © 13/24" 


F(a » wes Ze, Tea}. 


From this example, it isclear that the requirement that g be anSPN fune- 
tion, with g(2 , %2, %3, 2%, 4a Xo’) = f(xy, te, %3, 4) does not specify 
g uniquely. 


IV. CONCLUSIONS 


Clearly the derivation of the circuit for an SPN function is a process 
which can be mechanized. If a method for augmenting the canonical 


form of a non-network function can be formulated, then a purely me- 
chanical process could be set up for deriving the minimum series-parallel 
networks for an arbitrary function. 
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Circular Waveguide Taper of 
Improved Design 


By HANS-GEORG UNGER 


Manuscript received August 22, 1957) 


Transitions between round waveguides of different sizes for TE trans- 
mission are required to be free of mode conversion. Conical tapers with a 
gradual change of cone angle transform cylindrical waves in the round wave- 
guide into spherical waves in the transilion region. They thus cause very 
little power conversion to spurious modes. Optimal tapers and almost opti- 
mal tapers are found. An improved taper connecting 3 in. to 2 in. Lop. 
waveguide must be 3 ft long to keep the spurious mode level below —50 db 
for frequencies up to 75 kme. A taper of constant cone angle would have to 
be 58 ft long to obtain the same spurious mode level. 


I. INTRODUCTION 


In long-distance waveguide transmission, multimode waveguides of 
large diameter must be used to exploit the low-loss properties of the 
circular electric wave. Multiplexing of a series of frequency bands into 
one pipe is, however, most easily done at certain smaller diameters. Like- 
wise, sharp intentional bends can be negotiated more easily at small 
diameters. Therefore, transitions between different diameters must be 
made quite frequently in a waveguide system. 

If these transitions are built in the form of a conical taper which 
matches the waveguide sizes of both sides, they tend to excite higher 
circular electric waves. Since no simple means are known to suppress 
higher circular electric waves without affecting the lowest circular elec- 
tric wave, mode conversion-reconversion distortion can only be avoided 
by keeping power in all the higher circular electric waves at an extremely 
low level. Transitions are therefore required which will introduce very 
little conversion to higher circular electric modes. Conical transitions 
with a constant cone angle, unless they are made very long, generally 
excite too high a level of these spurious modes. 

Wave propagation in the conical taper is most easily explained in terms 


S899 
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of normal modes. The normal modes of the circular electric wave fam- 


ily —-the only family of interest here — are, in the cylindrical guide, 


cylindrical circular waves with plane equiphase surfaces and, in the 
conical guide, spherical circular waves with spherical equiphase surfaces. 
At the junction of a cylindrical guide to a conical guide — such as occurs 
twice in a conical transition —a cylindrical wave from the cylindrical 
guide excites a series of spherical waves in the conical guide. For in- 
stance, an incident TE; wave will excite all the TE», waves, thus caus- 
ing a rather high spurious mode level. 

To avoid this mode conversion, a transition which transforms the 
cylindrical wave into the spherical wave must be introduced at the junc- 
tion. 8S. P. Morgan! has suggested and worked out the design of dielectric 
inserts placed near the junction which, acting as quasi-optical lenses, 
transform the cylindrical waves into the spherical waves. However, be- 
cause of the dispersive character of the lenses and of the waveguide, 
good broadband performance is difficult to achieve. 

Another way of making a transition from cylindrical waves to spheri- 
cal waves — at least approximately — is to taper the cone angle from 
zero at the cylindrical guide to the finite value of the conical guide. If 
this is done gradually enough, nearly all the power incident in the cylin- 
drical wave will be transformed into the spherical wave, with a very low 
spurious mode level. Although this is only an approximate solution to 
the problem, it does have good broadband characteristics. 


Il. WAVE PROPAGATION IN A TAPERED WAVEGUIDE 


In a tapered circular waveguide such as the one shown in Fig. 1, the 
radius a is related to the distance z along the taper by a function which 
is assumed only to be smooth and to have a zero first derivative at both 
ends. 














Fig. 1 — Tapered waveguide transition. 
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The field excited in any cross section by an incident TE wave can 
be expressed as the sum of the TE, waves of a cylindrical guide having 
the same radius as that of the cross section. With this representation, 
the tapered waveguide appears to be an infinite set of mutually coupled 
transmission lines, each line representing one of the cylindrical TKon 
waves. The wave propagation is described by an infinite set of first- 
order differential equations. If we assume the taper to be gradual enough, 
the power in all TE, terms with m greater than 1 will be small compared 
to the power in the TE term. We may then consider only coupling 
between TE»; and one of the TE», terms at a time. Furthermore, we 
need consider only the forward travelling waves, since the relative power 
coupled from the forward waves to the backward waves is quite small. 
Thus, the infinite system is reduced to the well-known coupled line 
equations? 

GA —78,;A; + ky Ao, eA —7BxAo + kA, (1) 
dz ; dz : 
in which A,» are the complex amplitudes of the cylindrical TE: and 
TEom respectively, 8:2 are the phase constants of these cylindrical modes 
and ke , ky are the coupling coefficients. The coupling coefficients are cal- 
culated in the Appendix by converting Maxwell’s equations into general- 
ized telegraphist’s equations. Both the phase constants and the cou- 
pling coefficients are functions of the distance along the taper. Power 


conservation requires 


Iie 


kyako —k* 


Coupled line equations with varying coupling coefficient and varying 
phase constant have been treated by Louisell.* 

In order to solve (1), local normal modes must be introduced. Normal 
modes are waveforms in a uniform waveguide which propagate without 


change of field configuration or, in terms of the coupled line description, 


do not couple mutually. Analogously, local normal modes in a non- 
uniform waveguide are waveforms of a local cross section which would 
propagate without change of field configuration in a waveguide which is 
uniform with respect to that local cross section. For example, local nor- 
mal modes in the waveguide taper are the spherical waves of a conical 
waveguide with a cone angle corresponding to the local cone angle in 
the taper. They would propagate in the conical waveguide without mu- 
tual coupling. 
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Yor the system of equations (1) local normal modes are defined by 


A = (W, cos ; t _ W> sin ; :) exp (-1 | B iz) ; 
- (Ww, sin 5 + W, cos : :) exp (-) [ 3 iz), 


k k 
B=-(i+h), tang =2 =2—. (4) 
2 Bi — Bp ~ dB 


(3) 


in which 


Upon substituting (8) into equations (1), the local normal modes must 

satisfy 
dW, 
dz 


IE a, i ee bin 
i > —jTW2= —--—W,i, (5) 


bi call leat > tad dz 


where 
T(z) = VAs? + 4k. 
Equations (5) are coupled only through the terms proportional to dé /dz. 


If £ is constant, they reduce to uncoupled equations for W, and W,. 
For a gentle change in taper angle, with 
1 dé 
- | ee : (6) 
2r dz 
approximate solutions of (5) which proceed essentially in powers of 


dt/dz can be written down: 


dé 2 jp(2’ 


dz’ 


Wi(z) =e *" |W.) + 2 W.(0) 


€ 
0 dz’ 


his at son f 
— | W.0) | 


é € 
Jo dz’ J0 dz” 


Welz) = | W200) _ ; w,0) [ ete? ae’ 


- € 
/0 dz’ 


es W.(0) 3 dg ” peat [ dg "pees a'|, 


dz’ Jo dz” 


in which 


p(z) = . T'(2’) dz’. 
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The initial conditions in the taper are A,(0) = 1 and A2(0) 0. The 
taper begins with zero cone angle; hence, from (3), 


W,(0) = 1 


in which W, corresponds to the TE, wave and is the wanted local nor- 
mal mode, while W>. corresponds to one of the TE», waves with n + 1 
and is an unwanted mode. At the end z of the taper, the unwanted mode 


amplitude is 


(9) 


At the taper end the cone angle is again zero, (£(2;) 0). Therefore 
W2(z:) | equals | Ao(z,) |. Equation (9) integrated by parts becomes 


r= i 
W 2(2) = wz ies dz r (10) 
The mode conversion in a smooth but otherwise arbitrary taper can be 
calculated with (9) or (10). 


Ill. DESIGN OF A TAPER 


A waveguide taper can always be built to have as low a mode conver- 
sion as is wanted in a certain frequency band merely by making it long 
enough. However, an optimally designed taper has the smallest possible 
length for a given difference in diameters at its two ends and for a spee- 
ified unwanted mode level in a given frequency band. 

Some analogies between this problem and the problem of a trans- 
mission line taper of optimum design are evident. The transmission 


line taper for matching impedances is nothing but a tapered waveguide 


in which only one mode is propagating. Power can only be converted 
into reflected waves, and it is this reflected power which is kept small in 
a properly designed transmission line taper. If more than one mode is 
propagating, power will be scattered not only into the reflected wave 
but also into the other propagating modes. In fact, the power scattered 
into backward traveling waves is quite small compared to the power 
scattered into forward traveling waves, and only the latter need be 
considered in a multimode waveguide taper. Therefore, the mode con- 
version in the waveguide transition corresponds to the reflection in the 
transmission line taper. 

It has been shown that a transition between transmission lines of dif- 
ferent characteristic impedances is optimally made by a series of steps 
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spaced about a quarter wavelength apart.‘ The magnitude of these steps 
is chosen to give an input reflection described in its frequency dependence 
by a Tschebycheff polynomial. Similarly, a conical waveguide transi- 
tion is expected to perform optimally when it is composed of a number 
of sections with different cone angles, as in Fig. 2. The lengths of these 
sections should be chosen so that the converted energy from adjacent 
joints of sections is 180 degrees out of phase, and the changes of angles 
from section to section should be chosen so that the over-all conversion 
pattern corresponds to a Tschebycheff polynomial. However, since there 


is more than one mode to which power is converted and since the phase 


constants change along the transition, the design of such a multisection 
transition will be very difficult, if not impossible. 

If the number of sections in the line transition is allowed to in- 
crease indefinitely for a fixed over-all length, a continuous transmission 
line taper is formed. The results of the multistep transition of optimum 
design have been extended to this case.° The input reflection of the 
continuous taper of optimum design is described in its frequency de- 
pendence by a Tschebycheff polynomial of infinite degree. The taper 
curve itself is given essentially by the Fourier transform of this Tscheby- 
cheff polynomial. The frequency band of constant low reflection now 
extends from a certain lowest frequency to infinity. 

The transmission line taper of optimum design does not have a con- 
tinuous reflection distribution, but it has reflection impulses at both ends, 
Assuming a completely continuous reflection distribution, an almost 
optimum design has been found.® The raised cosine function is a reflec- 
tion distribution which, in practical cases, closely approximates in its 
frequency dependence the input reflection of the continuous taper of 
almost optimum design. 

If the integration in (9) for the mode conversion in the waveguide 


pen 
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Fig. 2 — Multisection waveguide transition. 
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taper is extended over the parameter p, 


ss 
Ws(py) E(p)e 7” dp ‘ (11) 
Jo 
the relationship of the mode conversion in the multimode waveguide 
taper to the input reflection of the transmission line taper becomes 
evident. In (13) p corresponds to the longitudinal coordinate and &(p 
to the reflection distribution of the transmission line taper. 

The reflection distribution for an almost optimally designed line taper 
can be substituted for &(p) to give the almost optimally designed wave- 
guide taper. Even the optimum design can be found when some care is 
exercised in using steps in the waveguide taper to simulate the impulse 
functions in the reflection distribution. 

Once &(p) is given, we can find the waveguide radius a(p) and the axial 
distance 2(p) from (4), (5) and (8). The coupling coefficient can be written 

da da 


cla) T c(a) ; (12) 
dz ap 


Therefore, 
da tan &(p) : 
cla) (13) 


dp VY1+ tan’ E(p) ¥ 


which can readily be integrated by separation of variables to give a(p). 
With a known, T is also known as a function of p, and the axial distance 
can be calculated: 


-p 


dp 
| —— . (14) 
“0 I'(p) 


Some restricting remarks must be made in concluding this outline of 
the taper design. 

Only one of the coupled higher-order modes has’ been considered here. 
This is adequate as long as the cone angle changes only gradually. Since 
all other higher order modes are weaker in coupling and farther removed 
in phase constant, the conversion to these modes will always be much 
smaller than the conversion to the TE wave. If, however, there are 
abrupt changes in cone angle or even if the diameter of the guide changes 
in steps, as in the waveguide taper of optimum design, the power con- 
version to higher modes must be checked. 

The design procedure outlined here works only as long as the coupled 
mode does not go through cutoff within the transition. Power converted 
to backward travelling waves has been neglected, since the coupling 
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to these waves is smaller and they are much more removed in phase 
constant than are the forward travelling waves. In the vicinity of cutoff, 
this is no longer true. Here, the propagation is more suitably described 
by the original generalized telegraphist’s equations (35) and (36) of the 
Appendix. They are, however, not easily solved in the cutoff region of a 
mode. 

Since the coupling coefficients in (35) and (36) do not change with 
frequency, and since the difference in propagation constants of the 
coupled waves does not increase with decreasing frequency, the power 
conversion does not increase when the frequency is lowered to bring a 
coupled mode to cutoff within the taper. Consequently, the waveguide 
taper can be designed for a frequency high enough to keep the lowest 
coupled mode propagating throughout the taper. It will then work pro- 
perly at all lower frequencies 


IV. THE RAISED COSINE TAPER 
As a characteristic example, we will design a taper whose conversion 
distribution follows the raised cosine function 
(15) 
Pi 


in which C is yet to be determined from the radii a; and a2 . According to 
(11) the over-all conversion will then be given by the Fourier transform 
of the raised cosine function 


W2(p1) 


Equation (16) is plotted in Fig. 3. 
We assume the TE»; and TE: modes to be far enough from cutoff so 


that 
(\) «K 1 (17) 
Ne 


for both modes throughout the taper. Furthermore, we assume the taper 
to be gentle enough so that 


P< i. (18) 


These assumptions will enable us to carry out the integrations analyti- 


cally. 
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Fig. 3 Mode conversion in the raised cosine taper 


Integrating (13) with 


from (42) and (12)| and &(p) from (15) we get 
c\p 5 


a» | eee 
In r(¢ 57 sin a), (19) 
a \P1 <7 Pl 


where C has been eliminated by the boundary conditions a(0) = a; and 


a(pi) = a2: 
(20) 


Because of (17) and (18) 
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The axial distance is found from (14) by substituting (19) for a in (21): 


4B; f? as re 
a | exp | 2 In — ew « dp, 22) 
1”? — ky? Jo 4 \pi 20 T pi 


where {> is the phase constant of free space. With a 1/ In aeo/a, and 
x = 2n(p/pi), the integrand can be expanded in a series 


28 2 ex 2 
“PoQ) pi ar : - 6 9 : ao 
~ ~ | e“(1—asinz += sin « ¥ ---)dzx (23) 
7 | k2 — k;?? ) Jo 2 


and, for moderate values of a, the axial distance z can be calculated by 
term-by-term integration. The total length of the taper is approximately 


2 ‘ 2 M2 2 Q2 
28 P oj x +2 1ln— In 
«Vo = ay ay 
+ ‘ 


~ ‘ e Qo o .« 
wr? + In? tor? + In? 
ay, ( 


(24) 


ly 


ly 


A numerical example will show the advantage of a properly tapered 


waveguide transition. Suppose a transition designed for the TE wave 
and connecting j in. 1.pD. pipe to 2 in. 1.D. pipe is required to have a spuri- 
ous mode level of less than —50 db for all frequencies up to 75 kme. 
The most seriously coupled wave is TE. For a raised cosine taper we 
get, from (20), C equal to 1.28 and consequently, from Fig. 3, p: equal 
to 14.8. Hence, from (24), the required taper length is 2, equal to 3 ft. 
The taper curve is plotted in Fig. 4. This curve has been calculated from 
(19) and (23), using p as a parameter. A taper of constant cone angle 
satisfying the same requirements would have to be 58 ft. long." 





36.2" 


Raised cosine taper. 
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Circular waveguide tapers have been built according to Fig. 4. Their 
TEom-TE2 conversion loss was measured at frequencies near 55 kme and 
found to be higher than 50 db. 


V. SUMMARY 


The power conversion from TE» to higher circular electric waves oc 
curring in conical transitions of round waveguide can be reduced by 
changing the cone angle gradually rather than abruptly. Instead of a 
taper with constant cone angle, a transition with changing cone angle, 
which transforms the cylindrical waves of the round waveguide into 
spherical waves in the transition region, is suggested. 

Upon converting Maxwell’s equations into generalized telegraphist’s 
equations, the transition is represented as a set of nonuniform transmis- 
sion lines, nonuniformly coupled. With proper choice of the coupling 
distribution — and hence the cone angle — as a function of the distance 
along the transition, we can find an optimum design which minimizes 
the length of the transition for a specified frequency range and spurious 
mode level. In a transition of optimum design the mode conversion is 
given by a Tschebycheff polynomial of infinite degree in its frequency 
dependence, and the geometry of the transition is found from the Fourier 
transform of this Tschebycheff polynomial. A simpler design, but still a 
good one, has a transition geometry given essentially by a raised cosine 


function. 
APPENDIX 


Generalized Telegraphist’s Equations of the Circular Electric Waveguide 
Taper 


A very convenient mathematical formulation of the electromagnetic 
problem in the waveguide transition is provided by 8. A. Schelkunoff’s 
generalized telegraphist’s equations for waveguides.’ Maxwell’s equa- 
tions for transverse electric and circular symmetric waves are, in cylin- 
drical coordinates: 

Ok, 


Oz 


lawl, ; 


oH. 


or 


Jue BE, + 


= jourl ’ 
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where £, , H, and H, are the only nonvanishing components of the field, 
e is the dielectric permittivity, « the magnetic permeability and w the 
angular pulsation. The exponential dependence of time is understood. 

The boundary conditions of the waveguide taper are, at r equal to a: 
E, = 0 (28) 


¥ 


H, =H. (29) 
dz 


The field at any cross section of the paper is represented as a super- 
position of the fields of the normal modes in a cylindrical guide of the 


same cross section: 
, 
1s(1s2) 
7 . ( 
E,= >> 1 


i m : V ‘aed o( Km ) : 


m 


m Vv ad o( ky, ) 


Js (kn?) 
H. x 2 a a 


J) (i r) 
H,=->d JI 


, 


m V rad ol km) 


where Jo and J; are Bessel functions of the first kind and k,, is the mth 
zero of J;. The V» , Zm and 7, have the dimensions of voltages and cur- 
rents. The factors of V,, and /,, are normalized so that P,, = 1/2(VmJm*) 
is the complex power flow in each normal mode. It has to be kept in 
mind that a is a function of z. 

The boundary condition (28) is satisfied by the individual terms of the 
series for E, . Hence, this series converges uniformly. Not so the series 
for H,: (31) is a representation for H, only in the open interval 
0 < r <a, since the individual terms vanish at r = a but, according to 
(29), H, does not. In the closed interval, the series (31) represents a dis- 
continuous function and therefore does not converge uniformly, Term- 
by-term differentiation will make the series diverge. 

The relationship between 7,, and V,, is found by substituting in (27) 
the series (30) for E, and the series (32) for H, , and comparing coef- 
ficients 


k m 


wpa 
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Using this relation and substituting the series (30) for 2, in (25) at r 
we find that the boundary condition (29) is satisfied. 

To convert Maxwell’s equations into generalized telegraphist’s equa- 
tions, we introduce (30) and (31) into (25) and (26), multiply both sides 
of both equations by J,|k,(r/a)| and integrate over the cross section. 
Since the series for H, does not converge uniformly, we write for the left- 
hand side of (26) 


m (i. 4 nk  ? "| one E (i 4 u,| «a 
a Oz Oz a Oz a | 


and invert integration and differentiation in the second term of this 
expression. 


The generalized telegraphist’s equations have the following form: 


dV, | da 2kak, . 
—jwpl, + i V.. 
dz a a dz p> ky? — k,? 


dl, : ae 1 da 2k 


—} a3 


dz " wy "  ad2z “SX ky — ke? 


The summations are extended over all m except m n. The quantity 
8, is the phase constant of the nth mode in a cylindrical guide of the 
particular cross section; 8, is a funetion of a and therefore of 2. 

The generalized telegraphist’s equations represent. an infinite set of 
coupled nonuniform transmission lines. For our purpose, it is convenient 
to write the transmission-line equations not in terms of currents and 
voltages, but in terms of the amplitudes of forward and backward travel- 
ling waves. Thus, let A and B be the amplitudes of the forward and back- 
ward waves of a typical mode at a certain cross section. The mode cur- 
rent and voltages are related to the wave amplitudes by 


V = WK (A + B), 


where K is the wave impedance 


, yu . 
5. = =o, (39) 
Bn 


If the currents and voltages in the generalized telegraphist’s equations 
(35) and (36) are represented in terms of the traveling-wave amplitudes, 
after some obvious additions and subtractions, the following equations 
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for coupled traveling waves are obtained: 


. = —jB; A eee K : B, + he (ee A m + k, - 
dz fe 


™ 


] . ay 
( B, 78, B, ‘4 K 1 


- A, + ee oe eo A 
dz 2 Ra 2 . . ;' 


The k’s are coupling coefficients defined by: 


Knkm K K ) | da 
k, ok : . n . , — , 
Km? — ky? ( | Phim ” 4 K,/ adz 


dk, 
dz ; 


K,, 


For a cylindrical guide, da/dz is equal to zero and (40) and (41) reduce 
to uncoupled transmission-line equations. 
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The Nonuniform Transmission Line 
as a Broadband Termination 


By IRA JACOBS 


Manuscript received December 17, 1957 


The problem of obtaining a broadband microwave termination is con- 
sidered from the point of view of nonuniform transmission line theory. At- 
tention ts restricted to lines in which only the distributed shunt admittanes 
may be varied. An optimization argument is presented which leads to the 
consideration of a line in which the fractional increase in admittance per 
wavelength in the line is constant. The nonuniform transmission line equa- 
tions are solved exactly for this case, and the results are expressed in terms 
of readily interpretable elementary functions. It is shown that a fixed geo- 
metrical length of line can lead to an arbitrarily large effective length without 
destroying the match at the input. The introduction of a small loss term 
makes the line almost totally absorbing regardless of its termination. 

The line has a long-wavelength cutoff given by 4x times the actual length 
of the line. If the line is short-circuited at its far end, a return loss of greater 
than 11.4 db is obtained at all frequencies above twice the cutoff frequency. 
The effect of certain practical limitations on the performance of this line is 
also discussed. 


I. INTRODUCTION 


Classical transmission line analysis leads to the propagation of a wave 
in which neither the electric nor magnetic fields have components in the 
direction of propagation.' These transverse electromagnetic (TEM) 
waves are characteristic not only of the usual transmission line structures 
such as parallel wires and coaxial cable; they are also characteristic of 
plane-wave propagation in isotropic media. 

It is often convenient, when dealing with TEM-wave propagation, to 
make use of results of classical transmission line analysis. Some care 
must be exercised, however, in applying these results at microwave fre- 
quencies. Consider, for example, the problem of terminating a lossless 
line. Classical analysis tells us that, if the line is terminated in its charac- 
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teristic impedance (which is a pure resistance for a lossless line), the 
termination will be reflectionless. Two difficulties arise at microwave 
frequencies, where the physical dimensions are no longer small com- 
pared to the wavelength. First, as the frequency increases, the concept 
of a lumped circuit element becomes less meaningful. For example, a 
resistive disc termination for a coaxial line will have an effective im- 
pedance which is strongly influenced by the geometry and has very little 
correlation with the de resistance of the dise.2 The second difficulty is 
that, even if the appropriate effective lumped impedance is obtained, 
the analysis assumes that this impedance is connected across an open 
circuit. An open-ended line at high frequencies is by no means an elec- 
trical open circuit. There is a true open circuit, however, one-quarter 
wavelength in front of a short circuit. Therefore, if the line is short- 
circuited by a metallic surface, and the appropriate characteristic im- 
pedance is placed one-quarter wavelength in front of the short, the 
termination will be reflectionless. However, if the frequency is changed 
the quarter-wave condition is destroyed, so that the termination is not 
broadband. 

The purpose of this paper is to consider analytically the use of a non- 
uniform transmission line as a broadband termination. If this structure 
is to be finite, it too must be terminated. To make the results independ- 
ent of what is beyond the nonuniform line and still maintain physical 
realizability, it will be assumed that the nonuniform line is terminated 
by a short circuit. The problem, then, is to match from a given charac- 
teristic impedance, Zo, to a short circuit. by means of an appropriate 
nonuniform transmission line. 

There is extensive literature® on the use of nonuniform transmission 


lines for impedance matching. Optimum matching procedures have been 


discussed‘ for matching two uniform lossless lines to one another by 
means of a lossless nonuniform line. In addition to the assumption of no 
loss, it is assumed that the magnitude of the reflection coefficient is 
much less than unity at all points along the line. Both of these assump- 
tions, however, are not applicable for matching to a short circuit. Since 
the object here is to absorb all incident energy, the line cannot be loss- 
less. Also, the magnitude of the reflection coefficient is unity at the short 
circuit. Thus, the matched-termination problem must be considered 
apart from the usual matching problem. 

In order to define the problem more precisely, it is necessary to con- 
sider the transmission line equations which determine the voltage V 
and current J along the line: 





NONUNIFORM TRANSMISSION LINE 
dV 
dx 


dl 
dx 


-Z(zx)l, 


—Y(x)V, (2) 


where Z(x) is the distributed series impedance per unit length and Y (xr) 
is the distributed shunt admittance per unit length. The line is said to 
be uniform if both these quantities are constant. 

A uniform line may be used to match from an impedance of Z» to a 
short circuit if Z and Y are both large complex constants (so that the 
wavelength in the line is small and the losses are large) and if, in addi- 
tion, VY Z/Y equals Z). Thus, in principle, a fixed length of uniform 
line can be made electrically long and extremely lossy, and yet it may 
have the same characteristic impedance Z» as does the line it is desired 
to terminate. In practice, however, this may be difficult to accomplish. 

Consider, for example, the problem of finding a microwave-absorbing 
material for anechoic chambers. Using the above principle it would be 
necessary to find a material with large complex relative permeability 
and relative dielectric constant, but such that the ratio of these two 
quantities was unity. It is not too difficult to obtain high dielectric con- 
stants at microwave frequencies,® but it is difficult in general to obtain 


equally large permeabilities.® It is of interest, then, to consider a medium 


with a permeability equal to that of free space, and to attempt to match 
this medium to a short circuit by increasing the dielectric constant as 
the termination is approached. In the analogous transmission line prob- 
lem, the distributed series impedance, Z(.), is a fixed constant, and only 
the distributed shunt admittance, Y(x), is at our disposal. 

The following problem is thus suggested. A uniform lossless line, 
characterized by a distributed series impedance per unit length of jwl, 
and a distributed shunt admittance per unit length of jwC,, is to be 
terminated by a nonuniform line of length s (see Fig. 1). The nonuniform 
line has a constant distributed series impedance per unit length of jwl, , 
but the distributed shunt admittance per unit length, Y (x), is as yet 
unspecified. This line is, in turn, terminated in a short circuit at x = s 
it is desired to find that function Y(2) which minimizes the reflection 
coefficient at « = 0 

The general variational problem is outside the scope of the present 
work. In the following sections a somewhat intuitive argument will be 
presented which leads to a particular form for Y(x). An exact solution 
of the transmission line equations will be obtained for this particular 
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nonuniform line, and it will be shown that the line indeed has the prop- 
erties indicated by the simpler intuitive arguments. 


Il, DETERMINATION OF THE ADMITTANCE VARIATION 


Approximation techniques’ indicate that, if the fractional change in 
the properties of the line per “local wavelength over 27’’* is small, then 
reflections may be assumed to be negligible. The exact solutions of par- 
ticular nonuniform transmission lines, such as the exponential line,’ in- 
dicate that reflections become important when this condition is violated. 

Since we are attempting to match from an admittance level jwC, to 
an infinite admittance, it is natural to ask the following question: How 
large can the admittance be made at x s, subject to the conditions 








TERMINATING § 








Fig. 1 — Nonuniform transmission line termination. 


that it be equal to jwC; at x = O and that the fractional change in ad- 
mittance per local wavelength in the line be small? 


The wavelength in the uniform lossless line characterized by L,; and 
Cis 


| 


2r 


d = 
Ww Vv LC; 


The local wavelength in the nonuniform line is given by \/+/ ¢(), where 


Y(x) _ C(x) wl jG@) 


me, ©; wl,’ 


e(x) = 


(4) 
in which C(x) is the distributed shunt capacitance per unit length and 
G(x) is the distributed shunt conductance per unit length. For the pur- 


* “Local wavelength” is defined as the wavelength in a uniform line which has 
the same distributed constants as the line in question at the point in question. 
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pose of the present section, it will be assumed that the nonuniform line 


is lossless, so that G(x) is zero and ¢(x) is real. The condition that the 
fractional change in admittance per local wavelength over 27 be small 


can then be written 


rX 1 de 


2Qr &? dx 


(5) 


where @ is an as yet unspecified small constant. 

The problem to be considered is thus: Given s greater than zero, find 
that function e(2) which maximizes e(s), subject to the conditions that 
«(0) equals 1 and that the inequality (5) is satisfied over the interval 
(0, s]. The solution to this problem is obtained by replacing the in- 
equality in (5) by an equality, which leads to the result 


1 ° 
da) ai} = r) _ (6) 
7, 


Thus, as « approaches \/za, €(x) becomes infinite. It would then appear 
that, in a fixed length of line, an arbitrarily large change in admittance 
can be utilized, and consequently a large effective length obtained, 
without violating the slowly varying condition. The introduction of a 
small imaginary component (shunt conductance) to e(2) should then 
make the line totally absorbing regardless of termination. 

In order to verify the above conjectures analytically it is necessary 
to solve the transmission line equations. This will be done in the follow- 


ing section. 


Ill. SOLUTION OF THE TRANSMISSION LINE EQUATIONS 


For a line with uniformly distributed series impedance per unit length 
jwL, and distributed shunt admittance per unit length jo( (x), the 
transmission line equations may be rewritten in the form 


dV 2r\" 
x)V =0 
dz? t (2 ) . : 


d'I 1 de dl 2n\" ; 
—- —-——-- xl 0, 
dx? ¢ dxdx r ( d ) - 


where X is given by (3). 

If e(x) is of the form (A + Bx)", (7) may be transformed? into Bessel’s 
equation of order 1/(n + 2). The fact that considerable simplifications 
result when n equals —2 has been noted previously," but the physical 
implications do not appear to have been discussed. 
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The expression for e(x), given by (6), is real. Since we are interested 
in absorption, the shunt admittance must contain a conductance term 
in addition to the capacitance, and consequently « must be complex. In 
order to maintain the same functional dependance for ¢, it will be as- 
sumed that the ratio of shunt conductance to shunt capacitance is con- 
stant, 


G(x) 


= (9) 
C(x) 


so that e(2) is given by 


If the change of variables 


is made, (7) and (8) can be rewritten 


OF he a Re ee (12) 
a- 


dr? dr 


a fs... 
<—- ——-+—(1 — jo/u)l = 0, (13) 
dr? dr a 
where the expression for € as given by (10) has been used. The solutions 
of these equations are 
V =e "Ve = + Vee] (14) 


and 


( 


Aa fz 
f=j-v; 
where 
5 (1 — jo/w) (16) 


may be interpreted as an effective propagation constant and r may be 
interpreted as an effective length. 

The lossless line (¢ = 0) is cut off when the effective propagation con- 
stant becomes real; that is, when a 2 4. If the shunt capacitance, C(x), 
is to be frequency-independent, it follows from the form of ¢(x) that a 





NONUNIFORM TRANSMISSION LINE 919 


must be proportional to wavelength. A cutoff wavelength X» and cutoff 
frequency wy may then be defined such that 


Re ON = tip 
do Ww , 


Thus, y can be written as 


I w ‘ Ww ' 
, : (5) V ; ” wo" T Jo wo. (18) 


It is interesting to note that these results are identical in form to the 
results that would be obtained for an exponential line,’ in which the ad- 
mittance and impedance vary as e“ and e~“’ respectively, and where r 
is defined as ax. However, in order to obtain the same effective length, 
R, the actual length of the exponential line must be longer by a factor 
R/(1 — e”), which increases linearly with R for large effective lengths. 
Thus, the equivalent exponential line will, in general, be considerably 
longer than the line considered here. 


IV. REFLECTION COEFFICIENT 


If the boundary condition, V Oat x s, is substituted into (14), 
the following relation is obtained for the input admittance: 


I(x = 0) C/I 
onaii , 9) 
Vir = 0) J - 4 (3 + 1 coth vit), ( 


where FP is the value of r corresponding to « = s; that is, 


Ke = -In(1 -- i) 
do 


The complex voltage reflection coefficient, p, is given by 


Vv C; Ly - i # 
VO/L + r; 





Substitution of (19) into (21) gives 


I+ 7 = 2i7 *° coth yR 
WwW 


Ww 





) ) 
- WO 9: 0 


— 27 — eoth yR 


w w 
For the reflection coefficient to be small over a broad frequency band, 


it is necessary that the transmission loss be large; that is, the real part 
of yR must be large. For any nonzero loss, ¢ > 0, one can, in principle, 
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Y (xX) =jJwC(x) + G(x) 
C(o) = C, 
G(x)/C(x) =o 


Wo ~ =: 
es TL,Cy 





bas 





Fig. 2 — Admittance ratio required for 13-db transmission loss. 


make the effective length of the line, R, sufficiently large so that the 
transmission loss is as large as desired. It follows from (20) that this 


ean be accomplished in a line whose length is 
nN i 
s=— ¢ —e wy 
Ar 
Thus, if the physical length and the desired effective length of the line 
are specified, the cutoff wavelength can be determined from (23). For 
large R, the cutoff wavelength is essentially given by 47s. 
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A large value of R is obtained by approaching the singularity in e(). 
The real part of e(x), at = s, is given by 


(24) 


so that exceedingly large values of distributed shunt capacitance are re- 
quired to obtain moderately large effective lengths. 

The above formalism may be used to calculate the distributed shunt 
capacitance at « = s required to ensure a given transmission loss. This 
result will, of course, depend on the value of the loss parameter, o/wo . 
As an example, in Fig. 2 the value of C(s)/C, required to ensure a mini- 
mum transmission loss of 13 db for all w = wy is shown as a function 
of a/wo .* If o/wo is very much less than one, an astronomically large 
value of C(s)/C, is required. Although this may not be realizable in prae- 
tice, it gives an upper limit to the ideal behavior of the line. 

If the transmission loss is sufficiently large, 


coth yR = 1. (25) 


If, in addition, 7/wo is neglected in comparison to unity, it follows from 
(18) and (22) that 


(26) 





L+vVi1 — we’ 

Equation (26) gives the intensity reflection coefficient of the ideal line. 
The return loss (— 10 logio | p |*) is plotted as a function of frequency 
in Fig. 3. The return loss is zero at the cutoff frequency, but increases 
rapidly as the frequency is increased. f 

As a more practical example, the return loss will also be calculated for 
a/wo = 2. It follows from Fig. 2 that this requires C(s)/C, to be 400 to 
ensure a 13-db transmission loss. This choice, in addition to being physi- 
cally reasonable, leads to some computational simplifications. It follows 
from (18) that, if ¢/wo is 2, then 


l -@ 
fe 1+ 5%), 


* The 13-db transmission loss requirement is equivalent to | e~?”* | = 0.05. It 
can be shown from (18) that the real part of y has a minimum, a» , given by the 
smaller of Vo/S8w) and o/4wy . These two results, together with (24), were used 
to obtain Fig. 2. 

t In Section II the parameter a was introduced, and it was assumed that re 
flections would be small if a were small. It has since been shown [in (17)] that 
a = 4/(w/wo). Fig. 3 then gives a quantitative demonstration of the initial sup 
position. For a = 4, there is total reflection. As a decreases (w/w» increases), the 
reflection is seen to decrease rapidly. 
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3.0 


w/w, 


Fig. 3 Return loss of the ‘‘ideal line’’. 


so that the effective attenuation constant (real part of y) is frequency- 
independent. The effective length of this line is R = 3. For these values 
of y and R, coth yR varies between 0.9 and 1.1 as the frequency is 
changed. If, as before, it is assumed that coth yR = 1 (thus assuming 
R > 1, which is equivalent to neglecting interference effects due to 
multiple reflections), the reflection coefficient of the line is given by 
|p| = : (28) 
l + (w an)? gs 
If this approximation is not made, the exact expression for | p |? is more 
complicated but still easily amenable to numerical evaluation. The solid 
curve in Fig. 4 gives the return loss as a function of frequency, as eval- 
uated from the exact expression. The dashed curve gives the return loss, 
neglecting interference effects, as determined from (28). In the frequency 
range depicted, the exact return loss is seen to oscillate about the value 
obtained when interference effects are neglected. However, in the high- 
frequency limit (w >> wo) the two curves diverge. The return loss in- 
creases without limit if the effective length of the line is infinite. How- 
ever, for R = 3, the return loss at high frequencies approaches 26 db, 
which is just the two-way transmission loss of the line. 
It is seen from Fig. 4 that there is a 3-db return loss at the cutoff fre- 


quency of the “‘ideal line.’”” However, as the frequency increases, the loss 


increases only gradually; at w = 49 the return loss is 13.3, db as com- 
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RETURN LOSS IN DECIBELS 











3.0 


W/W 
Fig. 4 Return loss of a ‘‘practical line”’ 
pared to 17.8 db for the ideal line. Thus, practical limitations on maxi- 
mum shunt capacitance have an appreciable influence on the behavior 
of the line. 
V. CONCLUSIONS 


Properties of nonuniform transmission lines have led to the considera- 


tion of a line in which the fractional change in shunt admittance per 
wavelength in the line is constant. The transmission line equations have 
been solved exactly for this case. The solution indicates that a fixed 


length of line s can be made to have as large an effective length as de- 
sired. Hence, with the introduction of a small loss term, all energy 
matched into the line is essentially completely absorbed regardless of 
the line’s termination. 

It has been shown that the line has a long wavelength cutoff given es- 
sentially by Ao = 4s. As the frequency increases beyond cutoff fre- 
quency wo, the reflected intensity from the short-circuited line dimin- 
ishes rapidly, being 11.4 db down at 2w» and 17.8 db down at 4a», as 
seen from Fig. 3. 

If practical considerations limit the maximum shunt capacitance, it is 
necessary to use a larger shunt conductance to obtain the same trans- 
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mission loss. This degrades the performance of the line somewhat, as 
shown by the example in Fig. 4. If a return loss of 10 db is required, it 
is seen from Fig. 4 that the line must be operated at frequencies above 
3.1 wo. Thus, the length of the line would be s = \o/4xr 0.25, where 
\ is the longest wavelength for which the return loss would be equal or 
greater than 10 db. This length is one-third of that which would be re- 
quired with the equivalent exponential line. 

The ‘ideal line’, as indicated by Fig. 3, gives a 10-db return loss at 
1.75 wo. The length of line required for a 10-db absorption would then 
be 0.144. The differences between the ideal structure and the practical 
example become even more pronounced as greater absorption is required. 
However, practical structures may approach the performance of the 
ideal line if one considers a variation of the loss term o in addition to the 
variation C(x). 

The nonuniform transmission line analyzed here may be considered 
to be a singularity of the general Bessel line. It is of analytic interest be- 
cause the solutions are in the form of readily interpretable elementary 
functions. It is also of, physical interest because the particular varia- 
tion of line parameters is suggested by a common approximation pro- 


cedure for analyzing nonuniform transmission lines, and because the 


exact solutions indicate that the line has desirable properties as a broad- 
band termination. 
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Using Contact Resistance to Measure 
Adsorption of Gases on Metals 


By P. KISLIUK 


Manuscript received December 5, 1957) 


The contact resistance between electrodes of molybdenum and tungsten is 
used to investigate the adsorption of nitrogen and oxygen. It is found that, 
on clean metals, the initial period of fast adsorption gives rise to a surface 
resistivity of about 4 X 10- ohm-cm?, corre sponding to a layer of the 
order of 1.6 Angstroms thick on each surface. To remove, within a minute 
or 80, the layers resulting from adsorption beyond this initial period, tem- 
peratures near 1700°K are required for all four metal-gas combinations. 

Some of the oxygen adsorbed at a pressure of a few torr* at room tempera- 
ture can be removed by lowering the pressure, but such an effect was not 
observed for nitrogen. Nitrogen or oxygen adsorbed during the initial period 
of fast adsorption on clean molybdenum or tungsten at a temperature of 
100°K is not removed in times of the order of one hour by lowering th 
pressure, in disagreement with some results in the literature. The rate of 
migration of adsorbed oxygen into and out of the region between closed 
contacts is found to be much lower than that reported in an earlier experi- 
ment of the same type. 


I. INTRODUCTION 


Measurement of the contact resistance between metal electrodes can 
be used to study surface films of nonconducting substances such as ad- 
sorbed gases. In comparison with other techniques it has the following 
advantages: 

1. It can be used in the presence of the gas. 

2. It can be used over a wide temperature range. 

3. It does not depend primarily on the change in work function, 
which might be small for some adsorbates. 


* Throughout this paper the unit of pressure will be the “‘torr’’, a unit widely 
used in Europe to designate one millimeter of mercury of pressure, i.e., 1.332 X 
10° dynes/em?. (See, for example, W. H. Westphal, Physvkalisches Wdérterbuch, 
Springer Verlag, Berlin, 1952). This avoids the awkward ‘‘millimeters of mercury 

” 


of gas X. 


925 
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1. Under many conditions it does not destroy the surface film. 
5. Adsorption can be studied on the bulk metal, which can be degassed 


by vigorous heating; i.e., dispersed conditions such as powders or 


porous films are not required. 

In general, the contact resistance arises from two effects, the constric- 
tion of the lines of current flow into a relatively small region of contact 
and the resistance offered by a surface film. The pioneering paper in 
this field is due to Holm and Meissner,'® who resorted to low tempera- 
tures to minimize the constriction resistance and showed that, for rea- 
sonably clean metals, there remained a resistance due to the surface 
film of the order of 5 X 10° ohm-em?. Although the method has been 
applied by a number of authors under a wide range of conditions,’ * '° 
only J. J. Went! attempted to obtain clean surfaces under high vacuum 
and then to investigate the adsorption of gases under controlled condi- 
tions. The present investigation is an attempt to extend these measure- 
ments using the improved vacua now available, and to interpret the 
results in terms of the advances in the knowledge of adsorption gained 
since that paper was published. 


Il. PRINCIPLES OF THE METHOD 


For a single circular contact area of diameter, d, much smaller than 
the dimensions of the electrodes, the ‘‘constriction”’ 
resistance is given by (Ref. 2, p. 16): 


or “spreading”’ 


R, = p/d, (1) 


where p is the bulk resistivity of the metal. 
A surface film gives rise to an additional resistance inversely propor- 
tional to the contact area (Ref. 2, p. 79), A: 


R, = 0,/A, 


where oa; is called the “surface resistivity.” 

Although R, is not completely independent of R, because of the re- 
arrangement of the lines of current flow, the change is negligible for our 
purposes (Ref. 2, p. 18), and we may write: 


R=Rk,+R, (3) 


Since R, is proportional to 1/d and R, is proportional to 1/d?, smaller 
diameters of the contact area lead to an increase in the ratio R,/R,. 
Thus, for sufficiently small contact forces and hard metals it is not 
necessary to go to extremely low temperatures to measure F,. In this 





ADSORPTION OF GASES ON METALS 927 


manner Went!! was able to perform some of his experiments on molyb- 
denum at room temperature. 

The theory accounting for the resistance R, for films less than about 
20 Angstroms thick, valid for small potential drops (less than a few 
tenths of a volt), is based on a model in which the adsorbed gas is re- 
garded as simply a spacer holding the metal contacts apart. The elec- 
trons flow by means of quantum mechanical penetration of the potential 
barrier. The result of calculations based on the potential barrier illus- 
trated in Fig. 1, is2- 1%" 


hl tol . 
5 exp j V 2m¢ } statohm-cem 
(] 


ev 2me 


3 xX 10 


"(L/+/o) exp (ly yg) ohm-em’, 

where h, e, and m have their usual significance, / is the thickness of the 
film and ¢ the work function of the metal. The second form of the ex- 
pression is valid for g in volts and / in Angstrom units. 

This resistance is ohmic (i.e., current proportional to voltage) accord- 
ing to both theory and experiment.':? A further refinement involves the 
inclusion of the electron image forces,’ '* and the result of this caleula- 
tion is plotted in Fig. 2. It may be seen from (4) that the exponential 


factor is dominant so long as /+/¢ is greater than one and that, under 


this condition, the resistance is extremely sensitive to / and slightly less 
sensitive to ¢. This is the case for all practically attainable values of / 
and ¢. 

To interpret the data one must have some knowledge of the dimen- 
sions of the contact area. For perfectly smooth electrodes under most 
of the experimental conditions, the contact force could, in principle, be 
supported elastically. Because of the roughness of practical surfaces, 




















Fig. 1 Potential diagram illustrating the model used in the derivation of 
equation (4). 
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Fig. 2 — Contact resistivity, o,; , versus film thickness for various values of the 
work function. The electron image force has been taken into account, modifying 
the potential barrier illustrated in Fig. 1. 


however, the load is in fact supported by squashing a small contact 


region inelastically. Under this condition the contact area can be caleu- 
lated approximately if the hardness of the metal is known: 


‘T4\P ; 
an (ro) H’ _ 


where P is the contact load in grams and H is the hardness on the 
Vickers scale. 
III. EXPERIMENTAL APPARATUS 

The apparatus used in these experiments (Fig. 3) consisted essentially 
of two mechanically polished crossed wires (diameter 15 X 10-* in.) 
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mounted on flat molybdenum springs. The lower of these was connected 
to a quartz yoke which could be pulled up by a micrometer screw operat- 
ing through a sylphon bellows. The force constant of the spring on which 
the upper wire was mounted was measured both, before mounting and 
after dismantling, and the constancy of the force constant as the spring 
was cycled between room and liquid nitrogen temperature was verified 
in a separate experiment. The motion of the upper spring as a function 
of the micrometer setting was calibrated with a cathetometer. 

The tube was mounted in a vacuum system (mercury diffusion pump 
and liquid nitrogen trap) capable of reducing pressure to the order of 
10-"° torr as measured on an ionization gauge. A gas-handling manifold 
was provided and isolated from the system by a Westinghouse-type 


oa 


| S 





~ 











Fig. 3 — The tube in which these experiments were performed 
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vacuum valve.!® The gases were of spectroscopic grade supplied by the 
\ir Reduction Company. The test wires could be flashed by conduction. 

The contact resistance Was measured in the region near 50 milliam 
peres and one millivolt by a four-terminal method using a milliammeter 


and microvoltmeter, exactly as described by Went.!! 


IV. EXPERIMENTAL RESULTS 


Kxperiments were performed on tungsten and molybdenum electrodes 
using nitrogen and oxygen as adsorbates. The behavior of the two metals 
is very similar, and that of the two gases differs only at relatively high 
pressures, The experiments are subdivided as follows: 

1. The behavior of clean metals. 

2. The resistance due to gas adsorbed at room temperature 

3. The temperature of flash necessary to remove the adsorbed film in 
a reasonable time. 

t. Adsorption at lower temperature 

5. The creeping in and out of adsorbed gas between closed contacts. 

One is limited in these experiments by the scatter in the data. In 
run of ten closures at the same force the largest value of the resistance 
Was usually about twice the smallest, and the average deviation from the 
mean about 20 per cent. After the electrodes had been exposed to the 
same conditions of gas pressure for equal intervals at widely different 
times during the course of the experiment even the average of ten re- 
sistance measurements sometimes varied by as much as a factor of two. 
This scatter is partly due to differences in local hardness because of the 
polycrystalline nature of the wires, but also could be caused by differ- 
ences in anneahng, as the wire Was flashed and then work-hardened in 
the contact region. It may also be due to local differences in adsorptive 
properties. In any case, as a Consequence little weight can be attached 
to small differences and only fairly gross effects can be interpreted 
reliably. 

Some of the relevant physical properties of tungsten and molybdenum 


are given in Table I. 


1.1 Clean Surfaces 


The metal surfaces were presumed to be clean after having been 
flushed repeatedly at a temperature of 2000°K for molybdenum, and 
at a somewhat higher temperature for tungsten, in a measured residual 


gas pressure of about 2 X 107'° torr. By measuring the increase in the 


maximum pressure during a flash as a function of the cold interval, it 
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PROPERTIES OF MOLYBDENUM AND TUNGSTEN 
resent W 


Resistivity, p, 300°K Mo 10°-* ohm-emt 
W 5.85 X 10°° ohm-em 


Mo 
W 


Ratio of Resistivits 


Hardness, H, 300°K 


Ratio of Hardness \\ 


* Measured resistivity ratios are for wires after outgassing in high vac 
The hardnesses were measured on large pieces trom different samples. Ir 
case of tungsten, the hardness samples had also been vacuum annealed, b 
molybdenum hardness sample was not so treated 


t Ref. 16, pp. 2357-2363 
t Refs. 17 and IS 


§ Ref. 19, pp. 1140, 1144 


was verified that about five hours were required to rench a constant 
value. The experiments to be reported in this section were performed 
Within a period less than one-half hour after the flash 

For a foree of 1 @m (+5 per cent the contact resistance averaged 
between 10 and 16 milliohms for molybdenum and between 14.5 and 
28 milliohms for tungsten. This range of values is for different samples 


and for measurements taken at different times during the course of 


these experiments. If the values of the hardness and resistivity listed 


in Table I are used in equations (1) and (5), one computes 22 and 36 
milliohms for the two metals. Considering that the load is supported 
elastically to some extent and that the contact region may not be exactly 
circular, this agreement is satisfactory. It is assumed in latter portions 
of this paper that contact areas obtained from the measured resistivities 
under these conditions are more reliable than those computed from the 
hardness. 

The contact resistance between clean metal electrodes, according to 
(1) and (5), should vary as the inverse square root of the force, and it 
does so within the limits of experimental error, as can be seen in one of 
the curves of Fig. 4.* However, to obtain this result it is necessary to 


* For the opposite extreme, when R,> R, rather than R, > R,; , one would ex 
pect / to vary as the inverse first power of the force if the load is supported by 
plastie deformation. The other curve of Fig. 4, obtained for the thickest film ob 
served during the course of these experiments, shows that this is also verified ex 


perimentally 
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Fig. 4 Contact resistance versus contact foree for clean tungsten electrodes 
and for electrodes which have been exposed to 6.5 torr of oxygen for 24 hours. Each 
point is the average for five closures 


increase the force to its final value and not to overshoot and come back. 
The effect of cycling is shown in Fig. 5. There is also a relatively slow 
increase in contact area with time and a typical curve of the resulting 
decrease in resistance is shown in lig. 6. 

The assumption that the load is supported on regions formed, for 
the most part, by plastic flow is thus confirmed by the following obser- 
vations: 

1. The approximate agreement of the measured value of is with that 
computed on this assumption. The contact resistances computed for an 
elastically supported load are considerably smaller than those found 
experimentally. 

2. The slopes in Fig. 4, which are 0.5 clean and 1.0 with a thick film, 


as expected for plastic deformation, rather than 0.8 and 0.7, as would 


be expected for purely elastic deformation. 
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Fig 5 The contact resistance of clean tungsten with increasing 
ing contact force The numbers on the experiment il points give the 


the measurements 


3. The irreversibility exhibited in Fig. 
1. The slow changes shown in Fig. 6 
The second point also shows that any surface film still present o1 


the 


nominally clean surface contributes very little to the measured resist 


nce, 
t lowering the tempera 


The results of attempts to check the effect « 
ture on the contact resistance of clean metals were inconclusive because 
1. The long time (about one-half hour required for the contacts to 


cool down to liquid nitrogen temperature after flashing made the 


acuuMm 


t} 


requirements much more severe. 
Pactual temperature oF the 


2. Inadequate radiation shielding made the 


contacts uncertain 


TO 











The contact resistance between closed clean tungsten contacts (lores 


Fig. 6 
1 gm) as a function of time. The tube was jostled slightly by passing footsteps a 


the time indicated 





O34 THE BELL SYSTEM TECHNICAL JOURNAL, JULY 1958 


lor both metals, the contact resistances measured when the contacts 
were closed with the tube immersed in a liquid nitrogen bath were lower 
by a faetor of about four than the contact resistances measured at 
room temperature with the same contact force. The resistivity and hard- 


ness ratios of Table I, however, would lead one to expect a factor of 


about seven.* 

In summary, the results of these experiments are consistent with the 
behavior expected for clean metals plastically deformed by the contact 
load. The contact areas for a force of 1 gm at room temperature are 
about 2 & 10°77 em® for molybdenum and 5 & 10-5 em? for tungsten. 


1.2 The Contact Resistance Due to Gases Adsorbed at Room T¢ mperature 


\ series of experiments in which the contact resistance Was measured 
aiter tungsten contacts were exposed to nitrogen and oxygen at various 
pressures and for various intervals is reported in Appendices A and B 
and summarized in Figs. 11 and 12. The gas was usually admitted by 
opening a metal valve very slightly and allowing the gas to leak in while 
the system was continuously being pumped. Because of the pumping 
action of the ion gauge and the glass walls, which decreased as they 
became saturated, the pressure showed a steady increase with time. 
However, for nitrogen, except at the lowest pressures, the effect was 
small, so that the rate of change in pressure could be ignored in comput- 
ing the pressure-time product, here called the ‘‘exposure’’. For oxygen, 
on the other hand, the effect was quite large and had to be taken into 
account. The corresponding desorption during the interval of pumping 
out the gas resulted in higher pressures after a given time with oxygen 
than with nitrogen. 

To interpret these results, it is helpful to review some of the results 
of measurements obtained by other methods. When nitrogen (and _ per- 
haps oxygen) molecules strike a clean molybdenum or tungsten surface 
at room temperature, the sticking probability, s, is relatively constant 
and of the order of tenths until a coverage of roughly 1.7 X 10" mole- 
cules per cm? is attained, at which point it falls off rather sharply to a 


much lower value.’ ” Thus, after a few seconds at 10~° torr or a few 
minutes at 10 ° torr, this coverage is attained, after which the coverage 
changes relatively slowly. The sticking probability continues to fall off 
with increasing coverage until, after a considerably longer time (of the 
order of 15 minutes for nitrogen at 10° torr, when the coverage is above 
3.5 X 10" molecules per cm?), the sticking probability remains relatively 
constant at a value of roughly 10~-*. When the coverage reaches approxi- 


* Went!! reports a factor of only 2.3 
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mately 5.5 X 10' molecules per cm’, the sticking probability for ni 


trogen falls off to an immeasurably small value.2! The ‘‘exposure’’ for 
the first step of rapid adsorption is, for nitrogen, approximately 3 x 10 

torr-min, and exposure needed for a relatively constant low value of s 
is approximately 107° torr-min. ‘To attain the immeasurably low stick 
ing probability the exposure is roughly 10° torr-min.* 

The results given in Appendix A and Fig. 11 for nitrogen on tungsten 
may be summarized as follows: exposures corresponding to the comple 
tion of the critical period of rapid chemisorption give rise to a surtace 
resistivity of 3.5 X 10°!’ ohm-cm*, while exposures corresponding to 
the attainment of immeasurably low sticking probability give rise to a 
surface resistivity of 12 x 10°? ohm-cem:?. Higher values of the surface 
resistivity observed for much greater exposures may be due to impurities 
In the gas. No appreciable part of the adsorbed laver giving rise to the 
observed values of o, could be removed at room temperature by 
pumping out the nitrogen 

The results given in Appendix B and Fig. 12 for oxygen on tungsten 


vield o, 5 X& 107°? ohm-cem? at the completion of the initial period of 
rapid absorption, and a, > 10-'° ohm-em? for exposures roughly 
ten times as great. However, at pressures above 10~° torr the resistance 
continues to increase at a measurable rate up to the longest times ob 
served (24 hours) and eventually exceeds any of the values observed 
for nitrogen on tungsten. This is attributed to the formation of one of 
the stable oxides of tungsten (perhaps WOs;). For the growth of such 
films a logarithmic law, the ‘“Elovich” equation, has been widely ob 
served,> but no attempt to follow this growth quantitatively will be 
made in the present paper. For the thickest film observed, resulting from 
the exposure to 6 torr of oxvgen for 24 hours (Fig. 4), o; was equal to 
6 X 10°58 ohm-em?. 

When the pressure has been as high as several torr of oxygen, loosely 
held molecules on the oxide film contribute to the contact resistance 
These can, however, be pumped off at room temperature. 

While the data on molybdenum are less extensive, the values of 
for the first and second steps ol adsorption on the two metals are simi 


lar, as is the behavior of oxygen at a pressure of a few torr. 


1.3 7" mn pe rature of Flash Nee ssary to Remove the Adsorbed Filn 


In these experiments, the contacts were exposed to a few torr of the 


gas in question for a few minutes, and the system was then pumped 


* The sticking probability may be a function of pressure at high pressure; 1.e 
processes other than the arrival rate may control the rate of reaction 
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hig. 7 The contact resistance at room temperature (force l gm) after heat 
ing for 30 see with various currents for molybdenum and tungsten contacts whieh 
h ive been exposed to a tew torr of oxygen or nitrogen 


down to a pressure of about 10-8 torr before the flashing procedure* 


Was 
started. 


30 seconds, after which 5 
minutes were allowed for cooling. The results are given in Fig 


The heating current was maintained for 
7, Where 
each point is the average for five closings. 

Flashing at about LOOO°IK causes some lowering of the contact resist- 
ance for oxygen on either metal, whereas for nitrogen there is no ob- 
servable effect below 1450°K on molybdenum, and below a somewhat 
higher temperature on tungsten. For all four combinations of gas and 
metal there is a sharp drop in resistance after flashing at 6.5 amperes 
(1600 + 50°K for molybdenum, 1750 4 
curve is essentially flat above about 
above L800°K for tungsten. 


100°K for tungsten) and the 


1725°K for molybdenum, and 


The molybdenum temperatures were measured with an optical pyrom- 


eter and suitably corrected for the emissivity and reflection in the glass. 


*Even at this lower pressure, & lave corresponding to the fast adsorption 
first step) can be adsorbed in the few minutes necessary for the electrodes to cool 
so that it was not possible to determine whether the metal had been completely 
cleaned, but only whether the thicker lavers had been flashed off 
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Untortunately, the wall of the tube in the tungsten experiment had been 
coated with evaporated metal before such a temperature measurement 
Was taken, so that these temperatures are taken from the tables ot 
Jones and Langmuir,! with an estimated correction for cooling at the 
ends. 

The data for OxVgen on molybdenum are consistent with those of 
Went,!! who finds some reduction in resistance after heating to 1000°K 
for 5 minutes and a more marked drop after heating to 1750°K. A rise 
1h resistance observed by Went at 1400 Ix Was hot observed 1h the pres 
ent experiment. 

lor oxygen on tungsten the results of field emission microscope experi 
ments are roughly consistent with the present experiments hese re 
sults have been interpreted to be due to these successive actions partial 
desorption from some crystal planes in one minute at SOO°K; a tempera 
ture region (SOO to 1400°hK) where there is no further desorption; 
resumption of desorption (perhaps of WO) from all planes in the region 
from 1400 to L700°RK. 

The observation of some relatively loosely bound oxygen 1s 
agreement with Morrison and Roberts,*? who suggest that these mole 
cules are adsorbed on isolated empty sites tor the chemisorption ol an 
atom, Where the molecule is prevented from dissociating by the lack of 
an adjacent empty site 

kor nitrogen on tungsten the present result differs from that of Becker 
and Hartman,?’ in that we do not observe a decrease in contact resist 
ance after 30 seconds at temperatures in the neighborhood of 11L00°K 
even when, in their notation, @& 2 (second step ol adsorption complete 
under which condition they observe a measurable rate of desorption. 
However, there is rough ngreement as to the temperature necessary to 


clean the metal surface. 


14 Adsorption at Lower T mperature 


Went" and de Boer and Kraak?® present evidence that oxygen is only 


physisorbed and may be pumped off when it is admitted to a nominally 


clean molybdenum surface at a temperature near LOO°K. On the other 
hand, Gomer®*° has shown that, even at 40°K, oxygen is rapidly 
chemisorbed on a clean tungsten surface. One is tempted to conelude 
that the vacuum techniques at the time of the experiments of Went and 
de Boer and Kraak were not adequate to maintain a clean surface during 
preparations necessary before oxygen Was admitted (cooling down after 


flashing, in the case of Went, and pre-sintering the evaporated film, in 
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the case of de Boer and Kraak). The reversible adsorption they observe 
may thus apply to surfaces already covered with chemisorbed atoms. 


The arguments presented by Went to show that his surface was essen 


tially clean can be shown to be inconclusive. One possible exception may 


be the high temperature required to recover his “clean” state, but the 
evidence that the surfaces remained clean while cooling down to liquid 
oxygen temperature is unconvineing. 

The present experiments were performed in a liquid nitrogen bath 
for oxygen on tungsten, nitrogen on tungsten and nitrogen on molyb 
denum. The results of these experiments are tabulated in \ppendix C. 
In no case was the initial state of cleanliness recovered by pumpmg at 
the low temperature, In the ease of nitrogen on tungsten, there was an 
increase in resistance during the pumping interval, possibly due to the 
accumulation of impurity gases on the surface. 

The temperature of the contacts in these experiments Was somewhat 
above that of the liquid nitrogen bath, because of inadequate radiation 
shielding. With all exposed parts of the tube packed with dry ice to 
reduce thermal radiation, the resistance of one of the tungsten wires 
was used as a thermometer. Using this procedure, the temperature in 
the above experiments was estimated to be 95 + 10°R. 

In summary, the bulk of the nitrogen and oxygen taken up during 
the initial period of fast adsorption at 95°KK cannot be removed in a 
few hours by merely reducing the gas pressure to a much lower value. 
This is in disagreement with some early results,!'?* but is in agreement 
with more recent experiments. 


1.5 The Creeping in and out of Adsorbed Gas Between Closed Contacts 


Went (Ref. 11, Fig. 10) reports that physisorbed layers of oxygen at 
room temperature can migrate in and out of the region between contacts 
of molybdenum. He presents arguments to show that this is possible. 
In the present experiments the observed effects were smaller by at least 
one order of magnitude. 

After the sequence of measurements of Fig. 6, where the fall in contact 
resistance with time on clean metal is attributed to a slow increase in 
area, 5 torr of oxygen were admitted to the closed tungsten contacts. 
The contact resistance as a function of time is shown in Fig. 8(a). The 
increase in resistance in 100 minutes is small and occurs in a single Jump, 
and so may be due to a jarring of the apparatus, 

If the contacts are closed in the presence of the gas, the resistance 
between the contacts decreases with time |[Fig. 8(b)|. This is not neces- 


sarily due to the squeezing out of gas, as it is of the order to be expected 
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Fig s 1) The resist ince between closed tungsten contacts rorce l gm is 
a funetion of time when 5 torr of oxvgen are admitted at ¢ = 0. This sequence ac 

tually follows that of I ig. 6 While no actual disturbance was noted at the time 
marked ‘Jostled ’ the flatness of the curve for earlier and later times indicates 
that this is very likely the reason for the sudden small increase in resistance during 
this interval b) The resistance bet ween closed tungsten contacts as a function ol 
time after exposure to 6.5 torr of oxvgen. At the point marked ‘‘A’’ on one of the 
curves the gas Was p imped out without disturbing the closed contacts 


because of slowly increasing area, judging from the rate of change of 
the resistance between clean contacts (Fig. 6). If the gas is pumped out 
during such a sequence, a small discontinuity sometimes results [Fig. 
S(b)]| which may be due to gas molecules migrating out of a small region 
near the edge of the contact area. 

A similar sequence for oxygen on molybdenum is shown in Fig. 9 


However, this is for a somewhat lower pressure and smaller time of ex 


posure before closing than that used by Went!" (2.7 torr rather than 10 


torr and two hours rather than ten hours). 


INTERPRETATION AND DISCUSSION 


The experiments on clean metal (Section 4.1) show that the assump- 
tions leading to equations (1), (2), (3) and (5) are approximately valid, 
and give a means for measuring the contact area. 

To interpret the data on films adsorbed at room temperature (Section 
1.2) by means of (4) it is necessary to assign a value to the work fune- 
tion, ¢g. The changes in contact potential during the adsorption of ni- 


trogen or oxygen show that the work function is increased," * but. in 
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ee ne 


Fig. 9 The resistance between closed, initially clean, molybdenum contacts 
force = 1 gm) as a function of time at room temperature. At point A, 2.7 torr of 
oxygen were admitted to the system. Point B is the average resistance for ten 
closings in the presence of the gas, Beginning at point C the contacts were again 
left closed. At point D the gas was pumped out. 


the presence of such an adsorbed film the potential diagram of lig. | 


is an oversimplification and the values of / obtained by the use of (4) 
can have only an approximate meaning. Furthermore, the film may not 
be uniform over the contact area, in which case the measured values are 
averages heavily weighted toward a value appropriate to the thinnest 
film within the contact area. The values of / according to (4), using the 
data of Section 4.2, are given in Table Il. Here, ¢ 5.0 volts for ni- 
trogen on tungsten; ¢ 6.0 volts for oxygen on tungsten. 

For the thickness of the layer on a single surface these values of 1 
should be approximately halved. Thus, the entire range of thickness of 
the layers on each surface in these experiments is from 1.5 to 2.6 Ang- 
stroms, i.e., one or two atom diameters in thickness. The interpretation 
of this distance on an atomic model, however, offers considerable diffi- 


TABLE II THICKNESS OF ABSORBED FILM ON TUNGSTEN 


Surface Resistivity Due to Film, e¢, Thickness of Film, / 


ohm-cm Angstrom Units 
age, Molecules, « 


Oxygen Nitrogen Oxygen 


L.¢ xX 1 10 5 X 10°" 3.0 
3.4 X 10* : Lx iv 4.0 
More than 4 X 10! ‘ up to6 X 10°8 i 


o” 
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culty, as is illustrated diagrammatically in Fig. 10. This difficulty is 


connected with the problem of where to place the potential barrier (as 
illustrated in Fig. 1) with respect. to the positions ol the atoms in the 
surface plane®. 

In Section 4.3 the range of temperatures necessary to remove these 
films from the surface is found to be in general agreement with that 
determined by other techniques. It is perhaps surprising how similar 
the behaviors of nitrogen on tungsten, oxygen on tungsten, hitrogen on 
molybdenum and oxygen on molybdenum are to one another: All require 
a temperature in the vicinity of 1700°K to remove a large fraction of 
the adsorbed gas within about 30 seconds. The heat of adsorption of the 
desorbing molecules is thus of the order of five electron volts. 

In Section 4.2 it was shown that some of the oxygen giving rise to the 
measured resistance after relatively great exposures ¢c vuld be p imped 
off at room temperature. This was not the case for nitrogen. The result 
reported in Section 4.3——that temperatures sufficient to remove a 
considerable amount ol oxygen had no observable effect on the nitrogen 
layers —confirms the conclusion that additional oxygen is bound, 
though with relatively weak bonds, on top of the more firmly held lay 
ers.267.54 Nitrogen, on the other hand, shows little tendency to stick 


to the completed nitrogen layers at room temperature, although some 


Fig. 10 \ conceivable arrangement of atoms when two (100) faces of a body 
centered cubie lattice are opposed, with chemisorbed gas atoms on each surface in 
one quarter of the lattice sites. If the coverage were twice as great, the opposing 
lattice planes would not necessarily be farther apart the alternate empty posi 
tions could simply be filled in. The relative radii are approximately those appro 
priate to oxygen on tungsten The distance, z, between the nearest opposed 100 
planes is 1.75 times that in the normal lattice, H, i.e., about 2.8 A. The distance 
l to be used in (4) should perhaps lie between this value and X H, the increase 


over the normal distance, which is about 1.2 A 
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of the gas adsorbed at liquid nitrogen temperature is released as the 
contacts are allowed to warm up (Appendix C, Table VII). 

The differences between the results of Section 4.4 on adsorption at 
95°K and those reported by Went!" are unexplained. 

In Section 4.5 the effects of migration of adsorbed molecules between 
closed contacts are found to be much smaller than those observed by 
Went (Ref. 11, Fig. 10). This might be due to low-frequency mechanical 
vibration in his experiments resulting from footsteps ete., which may 
have rocked the contacts, allowing different regions to be exposed.” Ex 
tensive precautions against vibrations were necessary to obtain the 
results presented in Figs. 6, 8 and 9 
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APPENDIX A 


We interpret the values of o, for the entries in Table II] numbered 2 
through 6 and 12 and 13, for exposures between 7 K 10-5 and 6 X 10 


torr-min as due to a layer corresponding approximately to the completion 


of the first step of rapid chemisorption. The average value of o, is 
3.5 & 10°” ohm-em? (+30 per cent). Entries numbered 7, 8, 15 through 
21 and 23 through 27, for exposures between 1.7 XK 10-4 and 8 X 107! 
torr-min, we attribute to a layer somewhere in the second or third steps 
of slower chemisorption. The value of o, averages 1.2 & 107° ohm-cm? 
(+25 per cent). 

The higher values of o; in entries 28 through 34 corresponding to ex- 
posures from 1.0 to 85 torr-min might possibly be attributed to an im- 
purity in the gas. Oxygen, if present in a few parts in 10°, could account 
for such an effect, while spectroscopically unobservable impurities might 
be present in concentrations of the order of a few parts in 10‘. The high- 
est value of a; observed in the presence of nominally pure nitrogen is 
~5 X 10-° ohm-cm’. 

Entries 9, 10, 20, 21, 33 and 34 show that none of the adsorbed gas 





“ABLE IT] THE ADSORPTION OF NITROGEN ON TUNGSTEN AT Roo 


TEMPERATURE (Fig. 11 


1. Flash repeatedly; allow to cool for 7 assumed 
10 min in- residual pressure 
Li x WwW torr 
Admit 7 X 10°" torr of No: wait 10 
min 
Wait 35 min 
Wait 35 min 
Admit 2 X 10 7 torr of N» ; wait 5 
min 
Wait 50 min 
Admit 8S X 10 torr of N -wait 5 
min 
Wait 30 min 
Close leak from No source: pump 
for 5 min pressure down to 
5 xX 10 torr 
Pump for 16 hours, pressure down 2X 10 
toS X 10 °* torr 
Flash repeatedly, pressure falls to 7. assume 
1X 10°°; wait 10 min 
Admit 4.5 X 10°-* torrof No ; wait 3 2x 10 
min 
Wait 10 min i 5X 10 
Wait 10 min I 7x 10 
Wait 15 min 33 1 xX 10 
Wait 10 min 3 2% 10 
Wait 15 min 33 1x 10 
Wait 40 min 35 6 xX 10 
Wait 50 min 3 - > 
Close leak from source; pump for 5 3 1x 10 
min, pressure down tol X 10-7 tort 
Pump for 16 hours, pressure down 3 = x 10 
to3 X 10°" torr 
Flash repeatedly, residual pres : , assume 
sure down tol X 10°° torr; wait 10 
min 
Admit 4 X 10 * torr of No ; wait 5 6 xX 10 
min 
Wait one hou 
Wait one hour 
Admit 1.4 X 10°? torrof No ; wait 5 
min 
Admit 3 X 10°? torr of Ns ; wait 5 
min 
Admit 1.5 X 10° 'torrof No ; wait: 
min 
Wait 30 min 
Admit 9 X 10°' torr of No ; wait 5 
min 
Admit 3.3 torr of Ne» ; wait 3 min 
Wait 20 min 
Open valve to pumps; in 5 min, 
pressure down to 3 X 1077 torr 
Wait 30 min, pressure down to 
2.5 X 10°* tor 


10 


10 


10 


AI OE 
Kx MRK KK K K KKK 
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TABLE IV THe ADSORPTION OF OXYGEN ON TUNGSTEN AT Room 


TEMPERATURE (Tig, 12) 


Treatment 


Flash repeatedly in residual 0, assumed 
pressure 2 X 10°!'° torr; wait 
10 min 

Flash in presence of 1 X 10 
torrof O».; wait 10 min, during 
which pressure rises to 2.5 X 
10-° torr 

Wait 20 min, during which 
pressure rises to 3.5 x 10 
torr of Os» 

Wait 30 min, pressure up to 
6.5 xX 10°° torr 

Wait 2 hours, pressure up to 
2.4 X 10°? torr 

Reduce pumping speed pres 
sure rises to 2 X 10°° torr; 
wait 5 min 

Wait 1 hour, pressure up to 
5 X 10°* torr 

Wait 20 min, pressure up to 
6 X 10°° torr 

Close valve to Oo source; in 
crease pump speed; wait 10 
min, pressure down to5 X 1075 
torr 

Pump for 80 min, pressure 
down to 1.5 KX 10-8 torr 

Pump for 16 hours, pressure 
down to 2X 10°* torr 

Flash repeatedly, pressure 
falls to 1.3 X& 10°° torr; wait 
10 min 

Admit O» to closed system at 
~2 X 10°° torr; wait 30 min, 
pressure falls to 5 X 10°° tort 
Wait two hours, pressure 
down to 1.8 X 10°* torr 

Wait 30 min, pressure down to 
1.6 X 10°° torr 

Admit O2 to pressure 9 X 10°! 
torr; wait 10 min 

Admit O» to pressure 3 X 10 
torr; wait 5 min 

Wait 30 min, pressure 4 X 10 
torr 

Wait 40 min, pressure 5 X 10 
torr 

Wait one hour, pressure 5 X 
10-5 torr 

Increase pressure to 3 X 10 
torr, wait 15 min 


x 


ak eR KR RM ROM Mm 
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TABLI lV Continued 


Close O» leak; increase pump 
ing speed; wait 7 min, pres 
sure down to 2.5 X 10°° torr 
Wait 25 min, pressure 1.2 X 
10 
Pump for 16 hours pressure 
be &-20* tort 
Admit O» to pressure 1.4 torr; 
wait 20 min 
Wait one hour 
Increase pressure to 2 
walt 380 min 
Wait 2 hours 
Open valve to pumps; wait 
one hour, pressure 5 X 10 
torr 
Pump for 1's hours, pressure 
2.5 X 10°? torr 
Pump for 16 hours pressure 
7 X 10-8 torr 
Admit 5.5 torr of O wait 2 354 ye 1.000 
hours 
Wait 30 min 110 

2 unusually 

large readings 

not included in 

average ~ 1400 
Open valve to pump; wait 2 319 
hours, pressure down to 1 X 
10-7 tor 


giving rise to the measured resistance pumps off at room temperature 


even after the nitrogen pressure has been as high as 3 tort 


APPENDIX B 


We attribute entries in Table IV numbered 3 through 6 for exposures 
from + & 10-7 to 3 X& 10° torr-min to the first step of tast adsorption, 
with of > 5 X 107" ohm-em*. Entries 7 through 11, 14 and 15, for ex 
posures from 2 X 10°* to about 10° torr-min, are attributed to the com 
pletion of the second step, with a, > 1.0 X 10-° ohm-cm2. For pressures 
of oxygen above ~10~° mm of mercury, the resistance continues to 
increase at an observable rate (entries 16 through 33 


Entries 28 through 31, 33 and 34 show that, after exposure to a few 
| 


ons 


torr, a considerable decrease in resistance occurs on pumping out the 
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APPENDIX ¢ 


Ldsorplion and De sorption at Low T moperatures 


TABLE V OXYGEN ON TUNGSTEN 


Contact 
Resistance 
Milliohms 


Iixpose to 10-° torr of O» at room temperature for a few minutes: 
measure resistance with force = 1 gm 

Close contacts with force = 2 gm 

Apply liquid N» and wait 30 min with contacts remaining closed 
Increase O» pressure to 10°° torr; allow contacts to stand open 15 
min; measure resistance with force 1 gm 140 
Wait 2 hours 160 
Open valve to pumps, wait 1 hour until pressure is 107-7 torr; measure 


resistance with foree = 1 gm 160 
Remove liquid No; allow system to warm 10 minutes, pressure 
2 & 10°? torr (force = 1 gm 120 
Pump 16 hours at room temperature, pressure 10 * torr (force 
] 
i 


%) 


gm 


TABLE VI NITROGEN ON MOLYBDENUM 


Flash contacts in liquid Ne bath; allow 30 min to cool, pressure 
7X 10°! torr 

\dmit Ne, pressure = 5 X 10°° torr; wait 10 min 

Open valve to pumps; wait 1 hour, pressure = 1 X 107° tor 

Admit nitrogen, pressure 6.0 torr; wait 10 min 

Open valve to pumps; wait 10 min, pressure 10° ° torr 


TaBLE VII — NirrRoGEN ON TUNGSTEN 


Contact 

Resistance 

Milliohms 
Force 1 gm 


Flash in liquid Ne bath; wait 30 min, pressure 2 X 10°!" torr Ss 
Admit 1.0 torr of Ne ; wait 40 min.. 85 
Open valve to pumps; wait 5min, pressure down to 3 X 10°° torr 124 
Wait 1 hour, pressure 3 X 107 tor: 150 
5. Remove liquid N2; wait 15 min to allow system to come to room tem 
perature, pressure = 107° torr 60 
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INTRODUCTION 


The shot noise output of any p-n junction device in any circuit can be 
calculated by enumerating the possible hole and electron motions and 
summing the noise energies delivered to the load. This method of analy 
sis will be illustrated by applying it to the de-biased p-n junction (See 
tion IX). A more complicated example, and the principle topic of this 
paper, is the general p-n Junction frequency converter (Section X] 

The technological significance of two kinds of semiconductor diodes 
used as frequency converters will be discussed briefly. Seetion II will 
explain how this theory of shot noise stimulated the recent development 
of low-noise microwave amplifiers using the nonlinear capacitance of 
p-n junctions. Section IT] mentions the nonlinear resistor, for which shot 
noise appears to be a vital problem. 

The “particle-method” formulation of p-n junction theory in terms 
of the statistics of individual carrier motions (Section VI) is an alterna 
tive to the diffusion equations ordinarily used in the analysis of p-n june 
tions. Either approach leads directly to the impedance of the junction 

* Supported in part by the U.S. Army Signal Corps under contract DA 36-039 
sc-5589. Preliminary reports of this work were given at the Microwave Crystal 


Rectifier Symposium, Fort Monmouth, N. J., February 1956, and the I.R.E 
l 


A .L.E.E. Semiconductor Device Research Conference, Boulder, Colo., July Q57 
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(Sections VIT and VIII) and to the small-signal transmission properties 


of frequency converters (Section X). However, the particle method is 
the most direct Way to calculate shot noise. 

After the general equations for noise figure are derived in Section XI, 
the consequences for nonlinear capacitors and nonlinear resistors are dis- 
cussed in Sections XII and XIII. One of the virtues of the analysis is 
the possibility of relating impedances and noise without regard to the 
structure of the p-n junction. However, a few examples of nonlinear 
capacitor structures are given in Section XII. 

As explained in Sections IV and V, the particle method and the diffu 
sion equation method deal with just one aspect of p-n junction phe- 
nomena: minority-carrier effeets, which have varying importance, de 
pending upon the device and its use. Minority carrier effects are nowhere 
more important than in p-n junction transistors. Thus, the particle 
method is an excellent way to calculate the shot noise in transistors, 


although such a ealeulation will not be riven in this article.* 


Il. NONLINEAR-CAPACITANCE AMPLIFIERS 


\ time-varying capacitor is an active device that can be used as a 
linear amplifier of small signals, usually, but not necessarily, with a shift 
In signal Irequency. The power required for the amplification is supplied 
by the motor, of whatever kind, that varies the capacitance. This prin- 
ciple is used in vibrating-reed electrometers for amplifying very low 
frequencies. 

To obtain microwave amplification, one needs a capacitance that 
varies at microwave or higher frequencies. Such a time-varying capaci- 
tance can be obtained by applying a microwave voltage to the voltage- 
dependent capacitance of a p-n junction diode. Remarkably general gain 
and power relations can be derived for circuits of this type. 

Amplifying diode-frequency converters are not new. During World 
War II, welded-contact germanium diodes were found to give conver- 
sion gain when used in superheterodyne receivers of the type then under- 
going extensive development for radar use.* With the then-existing tech- 
nology, it was found that these diodes were not as good as point-contact 
silicon diodes in the over-all system noise figure that could be obtained. 
As a result of this observation, point-contact silicon diodes took over the 
entire radar field until recently, and the manufacture of welded-contact 
diodes was discontinued. 

*For such a calculation see van der Ziel, A. and Becking, A. G. J., Theory of 


Junction Diode and Junction Transistor Noise, Proc. I.R.E. 46, March 1958, pp 
589-594 
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These results, though entirely empirical, may very well have led to a 
belief that amplification by diodes was a freak effeet and necessarily 
noisy. In the years following World War II, casual experiments on diode 
sunplification were performed in many laboratories, but low noise was 
not found, nor was it intensively sought. The amplification effect was 
put to practical use in transmitting modulators, where noise would not 
be important.° 

At this point, the shot-noise theory of p-n junction frequency con 
verters was developed. It pro\ ided one logically complete example ol 
how a diode could amplify with arbitrarily low noise. It was recognized 
that the minority-carrier mechanism assumed in the analysis was prob 
ably not as important, at high frequencies, as the depletion-laver ca 
pacitance; at the same time, the understanding acquired from the shot 
noise theory suggested that the depletion-layer capacitance would also 
be noiseless. 

Subsequent efforts to make improved nonlinear capacitance diodes‘ 
and to give them a fair circuit evaluation have been extremely well re 
warded. A diffused-junction silicon diode has given 9 db of gain with a 
2-db noise figure in converting 460 me to 9375 me.? Also, such a diode 
has given 35 db of negative-resistance gain at 6000 me with a 3-db 
noise figure (for double sideband input 

In view of the large variety ot possible lrequencey relations, to Say 


nothing of the circuits involving combinations of diodes, it is fortunate 


that it can be deduced that an ideal nonlinear capacitor is noiseless in any 


circuit. By p-n junction techniques it is possible to make nonlinear ca 
pacitors that are practically ideal, except for a series resistance. Work 
ing theories of nonlinear capacitors will therefore be concerned with the 


noise from this series resistance.’ 


Ill. NONLINEAR RESISTORS 


Point-contact crystal rectifiers are in widespread practical use in 
microwave superheterodyne receivers. It has long been believed that the 
useful action of these devices results from nonlinear resistance and, as 
yet, there is no reason to doubt that this is true. 

As far as noise is concerned, there is no difference between the mathe 
matical results for the p-n junction type of nonlinear resistance and the 
results that would have been obtained from earlier ideas of metal-semi 
conductor contacts. But the older theory was not carried very far for the 
calculation of noise figure. Therefore, it seems worthwhile to point out 


the importance of the local-oseillator waveform 
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In the past, most analyses have been made on the assumption of 
short-circuited harmonics. This rather arbitrary postulation of a wave 
form is not realizable in practice, but it does give rise to some simple 
relations between conversion loss and noise figure. Another case, open- 
circuited harmonics, will be found to have an entirely different relation 
between conversion loss and noise figure. Thus, the analysis of nonlinear 
resistors will show, by example, that no general relation between gain 
and noise figure is true. 

In contrast to the nonlinear resistor, a nonlinear capacitor gives low 


noise for any local-oscillator waveform. 


IV. p-h JUNCTION THEORY 


\s shown in Fig. 1, there are two important methods of analyzing p-n 
junctions, each complementing the other. The pseudo-equilibrium 
method is an accurate means of calculating the capacitance-voltage 
characteristic of a p-n Junction if one knows, from other considerations, 
that it is 2 nonlinear capacitor. However, this method gives no informa 
tion about Irequency dependence and noise, 

The kinetic method begins with an approximate separation of physical 
mechanisms, as discussed in Section V. This separation makes it rela- 
tively easy to calculate the frequency-dependent admittance and noise. 
The kinetic method can be applied not only to nonlinear resistors and 


nonlinear capacitors but also to the host of intermediate types of p-n 
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Fig. | Structure of p-n junction theory involved in or related to the caleula 
tion of noise figure. The encircled elements constitute the ‘classical’? kinetic 
theory of p-n Junctions 
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junctions (including the so-called ‘simple’ p-n junction between homo 
geneous semi-infinite p and n regions 

The usual mathematical approach in the kinetic method involves 
solving the diffusion equation, a differential equation for the average 
motion of carriers. This approach, which will be referred to as “classical” 
p-n Junction theory, is satisfactory for calculating Junction admittance 
In calculating noise, it seems to be much easier first to calculate the noise 
due to individual carrier motions and then to sum them. This second 
procedure will be referred to as ‘the particle method,” since the indi 
viduality of the carriers is preserved to a later stuge in the calculation. 

The particle method gives relations between small-signal junetion ad 
mittances and the noise figure of frequency converters. The train of 
reasoning to be used is indicated by the arrows in Fig. 1. However, the 
connecting lines in Fig. | may be followed in any direction. For exam 
ple, the small-signal admittances may be caleulated from the diffusion 
equation or measured experimentally and used to calculate gain and 


hoise figure 


APPROXIMATE SEPARATION OF PHYSICAL MECHANISMS 


The equivalent circuit of Fig. 2 denotes the principal physical mecha- 
nisms of p-n junction action. The p-n junction is considered to be di- 
vided into essentially neutral p and n regions and an intervening space 
charge region Known as the depletion layer 

The capacitance of the depletion liver may be analyzed by pseudo- 
equilibrium methods under the approximation that the depletion layer 
is free of carriers.’ This capacitance varies with voltage, and amplifying 
frequency converters can be built that rely on it alone. It is reasonably 
certain that the depletion-layer capacitance will be very important in all 
high-frequency nonlinear-capacitance diodes. Nevertheless, we will ig- 
nore the depletion-layver capacitance entirely in the following theory of 


gain and shot noise, except for qualitative remarks. This theory will be 


developed around the motion of carriers across the depletion layer. 


The theory of noise thus obtained will illustrate the fact that at least 
one conceptual type of nonlinear capacitance amplifier is noiseless. It 
will then perhaps be easy to believe that the depletion-layer capacitance 
also should not be a source of noise. The analysis should be dire tly ap 
plicable to practical nonlinear resistors, for these must have relatively 
small values of depletion-layer capacitance for satisfactory operation 


The inescapable importance of the depletion-layer capacitance is con 
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Fig. 2 Nquivalent circuit of p-n junction. In general, the nonlinear resistors 
and capacitors may be frequency-dependent. 





























Fig. 3 — Simplified equivalent circuit in which constant parasitic elements are 
issumed and hole and electron admittances are combined 
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nected with the fact that there is series resistance Rs between the june- 
tion and the metallic contacts, as indicated in Fig. 2. For many diodes, 
the series resistance is found to be reasonably constant as bias is varied, 
giving the equivalent circuit in Fig. 3. This observation is consistent with 
(but does not prove the conjecture that the series resistance exhibits 
thermal noise. If this conjecture is true, the noise figure will depend upon 
the frequencies involved and upon the product RsCmin, where C Is 
the value of the depletion lever capacitance at the maximum reverse 
voltage. But the immediate problem is to caleulat® ‘shot noise due to 
minority carriers (i.e., due to carriers that cross the space-charge region 

and Rs and the depletion laver capacitance will both be neglected. That 
is, the analysis will be carried out for the equivalent circuit of Fig. 4 
It should be noted that this neglect corresponds to the usual first ap 


proximation made in analyzing Junction transistors 


VI. STATISTICAL DESCRIPTION OF MINORITY-CARRIER MOTION 


Each minority CAITiel 2 hole in an n region or an electron in a p 
region can be thought of as an individual. A hole may begin its exist 
ence as a minority carrier by surmounting the barrier potential of the 
depletion layer and entering the n region, or by being spontaneously 
(thermally ) venerated nh the h region Its existence may end by recom, 
bination in the n region or by return to the p region. The situation for 
electrons is analogous 

One need consider only the holes and electrons that cross the barrier 
The life-histories of these carriers, which are the elementary events of 
the theory, fall into the three classes shown in Fig. 5. Their meandering 
paths indicate that the motion of a carrier is governed by statistical 


rules. These rules depend upon the detailed structure of the Junction, 


but it is in keeping with classical p-n Junction theory to assume that they 


do not depend upon the bias applied to the Junction 


























Equivalent cireuit of p-n Junction with negligible parasitics 
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Of the three classes of elementary events, the reverse carriers will, 


throughout the analysis, be the simplest to deal with because they are 


assumed to occur entirely at random. Holes thermally generated in the 
neutral region of a p-n junction move about according to statistical 
rules, and some of them come in contact with the depletion layer. The 
electric field in the depletion layer is in such a direction as to move these 
holes across to the p region (forward bias decreases this field but ordi- 
narily does not change its direction). Similarly, some of the electrons 
generated in the p region cross to the n region. These “saturation cur- 
rents’”’ are represented by current generators 7, and 7, In Fig. 2. They 
are direct-current generators and do not enter physically in the ac ad- 
mittance of the junction, although their values can be introduced, by 
mathematical manipulation, into expressions for the ac admittance. 
But the saturation currents do produce shot noise. The sum of 7, and 
isn IS the saturation current, which may be determined experimentally 
by measuring the current at a moderate reverse voltage. 

Of the many holes in the p region, a few will overcome the potential 
barrier of the space-charge region and be emitted into the n region. The 
rate of emission will be assumed to be a nonlinear function of the po- 
tential across the space-charge region. This function will be written 
R(v), where v is the deviation (in the forward direction) from the equi- 
librium potential. ‘Classical’ p-n junction theory, to be considered in 
Section VIII, involves assumptions equivalent to taking 














Fig. 5 The three classes of hole and electron motions (‘‘elementary events’’) 
(1) reverse carrier from spontaneous generation; (2) forward carrier — emission 
followed by recombinations; (3) forward-and-back carrier — emission and return 
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R(a constant X B= q kT, 


where qg is the electronic charge, /& is Boltzman’s constant, and 7" is 
absolute temperature. The constant of proportionality is not specified. 
The conversion and noise analyses will be carried out with an arbitrary 
function R(v), to accommodate refinements in p-n Junction theory that 
may specify an emission relation differing from (1 

The probability of an emitted carrier nof returning will be denoted 
Py. Carriers with such a fate lead to a nonlinear conductance in the 
equivalent circuit. The probability of an emitted carrier returning at 
time ¢ after emission will be P((). Forward-and-back carriers result, 
In general, In a nonlinear conductance and a nonlinear capacitance whose 
values and frequency dependence can be calculated from P(t’). Since an 
emitted carrier must either return or not return, 

Po+] Pwtyat 

It is consistent with the usual approximations of p-n Junction analysis 
to assume that 7?) is a constant and that P(/’) depends only upon 
These assumptions will permit the noise analysis to be carried out rather 
simply. However, the effect of high current densities upon Py) and P(t’ 
may be an important feature of point-contact diodes. 

All that has been said about the emission of holes from the p side 
applies to the emission of electrons from the n side. Of course, R(v), P 
and P(t’) may be quite different for the electrons. In general, one must 
perform analyses for the holes and electrons and then add the currents 
However, if (1) holds, ® for electrons is proportional to ® for holes, for 


all values of v. Then one can define suitable ®, Py and P(t’) to apply 


to holes and electrons taken together. To simplify the presentation, it 
will be assumed that this has been done. 

In assuming a certain probability for recombination of an emitted 
carrier, we are, in effect, assuming linear recombination. The assump- 
tion that the probability of return depends only on the time after emis- 
sion is equivalent to the assumption of a linear diffusion equation in- 
cluding, possibly, drift in a steady electric field. These are, of course, 
usual assumptions which are made in analyzing p-n junction devices. 
It is not necessarily assumed that the recombination occurs uniformly 
throughout the volume. Surface recombination or recombination at 
localized centers is consistent with the assumptions as long as the re- 
combination is linear. Also, we are not restricted to simple diffusion; 
drift in a steady electric field is permitted. 

The currents for the three types of carrier events are illustrated in 





QO0 THE BELL SYSTEM TECHNICAL JOURNAL, JULY 195S 


lig. 6. The reverse carrier gives a single negative pulse with an area 
equal to one electronic charge; the forward carrier gIVeS 2 positive pulse 
with an area of one electronic charge. The detailed shape of the pulses 
might differ from one to the next and, furthermore, the average shape 
of the forward pulses might be somewhat different from the average 
shape of the reverse pulses, However, it will be assumed in most of the 
analysis that the entire pulse is of such short duration that the exact 
shape is of no consequence at the frequencies of interest. The third type 
of event — forward emission followed by the return of the same car 
rier — leads to a current whose integral is zero net charge. The forward 
pulse part of this single event should be similar to the forward pulse of a 
carrier that is emitted and does not return. Likewise, the reverse pulse 
part should be similar to the reverse pulses of the thermally generated 
carriers. However, the fact that the two pulses correspond to a single 
event is statistically important. 











Fig. 6 Current pulses for the three types of minority carrier events 
VII. SMALL-SIGNAL ADMITTANCE 


The small-signal admittance of the p-n junction will be calculated from 


VK, Po and Pr}. The calculation introduces some relationships needed in 


the main analysis. To avoid the introduction of a large number of symbols, 
a time function and its Fourier coefficients or spectrum will be denoted 
by the same symbol, for example, 7(¢) and 7(w); P(w) is an exception 
with a special definition to be given in (10). 

In general, the current is equal to the difference between the current 
of forward carriers and the current of reverse carriers. lor instantaneous 
forward and reverse transits, this means 


x 


i(t) -i, + galo(t)| — q¢ dt alot —- t)|PC’). 


v0 


The spectrum of 7 may be found from 
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This normalization of the spectrum will be adhered to in the calculation 
of noise. The small signal admittance also, the conversion matrix dis 
cussed in Section X) is a ratio of current to voltage, so any consistent 


normalization may be used. Consider, then a voltage of the form 
ve ) Re: 


where r(w) is the complex amplitude of a vanishingly small ae voltage 


of angular frequency w. The expected solution is 
(et 
Mw V(w v 


The problem is to find ¥(@) and 7% . 


Because p(w) is vanishingly small, 
Riv(t)] R(vo) + RK’ te (we? 8 


Inserting this expression in (3), and performing the integration indicated 
in (4), one finds that 


q Rn’ 
where 


Also, 


after (2) is used. 


VIII, CLASSICAL p-lh JUNCTION ANALYSIS 


The analysis given thus far will be developed later into ways of re 
lating conversion gain and noise to the small-signal admittances of a 
junction. Classical p-n junction analysis provides a way of predicting 
the small-signal admittances of postulated structures, although it does 
not give unique values for R(v), Po and P(t’). Instead, it gives the prod- 
ucts R(v)Po and R(v) P(t), which suffice for the desired calculations. 

The basie assumption of the classical theory is that the hole concen- 


tration p; on the n-side of the depletion layer is given by” 


—r Dre 12 


where py is the equilibrium value of py. Equation (12) is a boundary 


condition for the diffusive flow of holes in the neutral n-region. The 
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generalized diffusion equation (including recombination and drift) is 


Op p — Dy 


: + DV'p — uV-(pE). (13) 
ot T 


The hole concentration p is a function of time and of position within 
the neutral n region; D, w and 7 are diffusion constant, mobility and life- 
time for holes in the n region. The equilibrium hole concentration py 
will vary with position in the n region if the donor concentration is not 
uniform. For the formal analysis, 7 and the electric field E will be as 
sumed to be independent of p and ¢, so that (13) is 2 linear partial differ- 
ential equation. 

The hole current density can be calculated from 


J —qDVp 4 qupE. (14 


If the voltage, v, is composed of a number of sinusoidal components’ 
the hole concentration will be composed of sinusoidal components. Be 
cause (12) is nonlinear, the set of frequencies involved in p; will not, in 
general, be the same as the set of frequencies in v. Even so, the relation 
between p; and J is linear and may conveniently be expressed in the form 

AqD p(w) . 
r(w) (lo) 
DL. 
where A is the junction area and L, is the ‘effective ac diffusion length” 
for the structure in question. In other words, L, is simply a quantity 


that makes (6) and (15) give the right answer for 7(f). This right answer 


is the current obtained by solving (13), applying (14) and integrating 
over the Junction area; this procedure will be used in Section XII in the 
calculation of ZL, for a particular nonlinear-capacitor structure. 

The small-signal hole admittance can be found by considering in (12) a 
voltage of the form given in (5). Then 


BAgDpyer’® 
L 


Y(w) (16) 
The electron admittance is calculated similarly, usually with a quite 
different value of L,, . 

A comparison with the statistical description can now be made using 
(9) and (16). One finds 


BA Dpye”' “4 


®’(v)P(w) 
Le 
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ADpye* 
L 


R(1) P(w) 18) 
since equals 38%, from (1), which is the form of ® that must be as 
sumed to effect a comparison of the particle theory with the classical 


theory. Also, if (1) is assumed, (9) can be written 
V(w qo Kn P(w ° 19 


In the analysis of conversion and noise, it will be found that all im 
portant quantities can be expressed in products of RP(w) or R'P(w). 
Classical p-n junction theory therefore gives a complete solution of the 
problem. Separately defined values of @ and P(w) are not needed be- 
cause a carrier that goes across and then comes back in an infinitesimal 
time might Just as well not have been emitted in the first place. 

The noise analysis will be carried through for an arbitrary emission 
function, R(v), in case the classical theory is not a satisfactory one tor 


some situations. 


IX. SHOT NOISE FOR dc BIAS 


The application of the above principles to the calculation of shot noise 
in de-biased p-n junctions has been published." For this simple case, 
many of the steps in the calculation combined so obviously that explicit 
mention of each step was not necessary. Here, a step-by-step derivation 


of the de results will be given as a model for more complicated cases. 


The ealeulation of shot noise will be based on the following general 


principle: Each elementary event delivers a certain amount of noise 
energy to the load. The noise power is the rate at which noise energy is 
delivered to the load. 

At a given terminal pair of a linear circuit, the noise can be repre 
sented by it mean-square current generator (2°), since the noise power 
delivered to a load connected to the terminals can readily be calculated 
from such a generator, the load admittance and the output admittance 
of the terminals. Representation by a current generator seems particu- 
larly appropriate for shot noise, where the elementary events are repre 
sented by current generators. 

In p-n junctions, the elementary events are the three classes of mi- 
nority-carrier histories: reverse, forward and forward-and-back. Assoei- 
ated with each carrier event is a small signal current generator 9(7), as 
illustrated in Fig. 6, with Fourier transform 9(w) 

Suppose that ¢(w) is observed for an infinitesimally small frequency 
band B. Then the transform back to the time domain of this selected 





QO4 THE BELL SYSTEM TECHNICAL JOURNAL, JULY 1958 


element of the spectrum is a current generator 


S(t) | dwd(w)e" 4 complex conjugate, (20) 


/R 


where the indicated integration is over the values of w that lie within the 
selected band. 
If one writes 
G(t i+ A*, (21 
then #°(f), which is proportional to the instantaneous input power, is 
g(t 2AA* + A* + (A*) 
Integration over total time (to obtain a quantity proportional 


energy) can be shown to drop out the latter terms, so that 
| s(t) dt =2 | AA* dt. 


Now, AA* can be written as the double integral 


AA* = | dw | 


/R 


Integrating with respect to time gives 


| AA* dt | da’s(w)5*(w’)[208(w 


| so) at Sar B ! d(w) |. (26) 


If the carrier transits are instantaneous, a forward or reverse transit at 
time fy) gives a current generator 


t g(t to), (27) 


where the impulse function 6(¢) has negligible width in time, but its 
integral over time is unity. 


The spectrum of (27) is 
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lor either a forward or a reverse transit, (26) becomes 
S(t) dt = 2¢°B. 29) 


The development thus far could be used only to treat the nonlinear- 
resistor type of p-n Junction. For the general p-n junction, forward-and- 
back events must be considered. A forward pulse at f , followed by a 


reverse pulse at to + 0, gives a current generator 


qa(t — to) — galt 


For this case, 


, 


| #(t) dt = 2¢°B(2 — 2 cos wl 


The shot noise for each class of carrier event is found by multiplying 
the appropriate integral in (32) by the rate of occurrence of such carrier 
events. Thus, 

> yf » > 

«(] b 2qbi 
q 

24 BR») P 


27 BR) | (2 


2¢ BR(w»)[P(w) 4 
The total shot noise is the sum of (33), (34 
2gBhi tr YG KR(r [P(w) + P(w)* Pol}. 36 


lor the important special case of classical p-n junction theory, (36 
can be transformed through use of (11) and (19) into 


tkT BG(w) — 2qioB, 


where G(w) is the real part of V(w#). This result is easy to remember be 
cause it has the form ‘thermal noise minus shot noise’. However, it is 
evident from the derivation that no subtraction of one noise from another 
is actually involved. 

A previous derivation of (37) was given under less general assump- 


tions,” and the equation was compared with experimental results for 
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element of the spectrum is a current generator 


dwd(we" 4 complex conjugate, (20) 
I 


“B 


where the indicated integration is over the values of w that lie within the 
selected band 
If one writes 


§(t) = A + A*, (21) 
then 9°(t), which is proportional to the instantaneous input power, is 
$°(t) = 2AA* + A® + (A*)’, (2: 


Integration over total time (to obtain a quantity proportional to the 
energy) can be shown to drop out the latter terms, so that 


[ #o a 2 | AA at (23) 


Now, AA* can be written as the double integral 


AA* = [ a | dws’ $(w)9*(w’ Je * 
B 


“B 
Integrating with respect to time gives 


[ Aat at — / dw | des’ (w)I*(w )[276(w —w)| 


~“B “RB 
Qn | dw | 5(w) ’, 
B 
so that 
[ #@ dt 8rB | S(w) ¢ (26) 


If the carrier transits are instantaneous, a forward or reverse transit at 
time é gives a current generator 


g(t) = +q6(t — to), (27) 


where the impulse function 6(¢) has negligible width in time, but its 
integral over time is unity. 
The spectrum of (27) is 
7 l . jwt q 
I(w) = — + gi(t — te dt = +—e 
2r — 20 


2r 
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lor either a forward or a reverse transit 26) becomes 


My) 


The development thus far could be used only to treat the nonlinear 
resistor type of p-n junction. For the general p-n junction, forward-and 
back events must be considered. A forward pulse at ft , followed by a 
reverse pulse at fo + U, gives a current generator 
S(t) q(t — to) — gélt 30) 
( ) 

a [eo - (tote) 31) 
=~ 


S\(w) 


For this case, (26) gives 


F / 
29) 


3°(t) dt = 2¢ B(2 — 2 cos wt ). 32 


The shot noise for each class of carrier event is found by multiplying 
the appropriate integral in (32) by the rate of occurrence of such carrier 
events. Thus, 
2p ts 
2¢B 

q 


27 BRI vo) Po ° 


—_ 2qBi m 


(t reverse 


a forward 

200 

» 9 ; ty , 
(¢ torward uind-back => 2q¢ BR (v9) | (2 = 2 COS wt Pt ) dt 


= 2¢° BR(v)[P(w) + P(w)* — 2Pol. 
The total shot noise is the sum of (33), (34) and (36): 
{ 


(&)rotar = 2qBlis + qR(vo)[P(w) + P(w)* — Pol}. 


For the important special case of classical p-n junction theory, 


can be transformed through use of (11) and (19) into 


(i tkTBG(w) — 2qioB, (34 


total 


} 


where G(w) is the real part of Y(w). This result is easy to remember be- 
cause it has the form ‘‘thermal noise minus shot noise’’. However, it is 
evident from the derivation that no subtraction of one noise from anothe1 
is actually involved. 

A previous derivation of (37) was given under less general assump- 
tions,” and the equation was compared with experimental results for 
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p-n junctions." The good agreement with experiment suggests that 
it is worthwhile to analyze the noise of a p-n junction frequency con- 
verter. 


X. SIGNAL TRANSMISSION IN LINEAR FREQUENCY CONVERTERS 


In a linear frequency converter, the output signal voltage and current 
are linearly related to the input signal voltage and current. 

Of course, frequency conversion by a nonlinear device requires a finite 
local-oscillator voltage. The local-oscillator interaction with the device 
is quite nonlinear. Since the local-oscillator drive is regarded as a power 
supply, not an intelligence-bearing signal, a linear frequency converter 
can be made with a nonlinear device. Linear behavior is obtained when 
the signal voltages are assumed to be arbitrarily small. 

The analysis of signal transmission in the linear frequency converter 
will be made on the assumption that the local-oscillator waveform is 
periodic, with a fundamental frequency 6/27. This assumption is not 
trivial, for nonlinear capacitors can be tuned to oscillate at frequencies 
incommensurate with the driving-oscillator frequency, as well as at sub- 
harmonics of this frequency. The formal results could be generalized to 
cover the nonperiodic case, but most practical situations involve periodic 
waveforms. 

The frequency components of the emission rate @(v) must be related 
to the frequency components of the voltage, (4), applied to the junction. 
The voltage will be given in the form 


v(t) = Vpo(t) of Usia(t), (38) 


where vzo(t) is the large, periodic local-oscillator voltage (plus any de 
voltage) and vgie(t), which will be regarded as arbitrarily small, is the 
sum of all the signal voltages. Here the word ‘‘signal’’ includes the input 
and output signals and the image-frequency signals, all of which are 
linearly related. The method of calculating these linear relationships 


will next be given. The local-oscillator voltage, whatever its waveform, 
may be written 


Vio(t) = > Vio(m)en, (39) 


m 
where m takes on all positive and negative integer values including zero. 
Most frequency-conversion problems can be treated by proper identi- 


fication of s in the following representation for vgie : 


Usiq(t) = Re >. v(mb + set! (40) 
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For example, s/2x could be the intermediate frequency (output) of a 
superheterodyne receiver (Figs. 7 and 8), or the input frequency of a 
transmitting modulator (Fig. 9). All of these circuits and many others 
can be obtained by suitable connections to the multiterminal ‘black 
box”’ indicated in Fig. 10. Negative frequencies are included in the set 
(mb + s)/22; they are the “lower sidebands’’. For the sake of obtaining 
noise-figure expressions, it will be assumed that the input and output 
are at two distinct frequencies (mb + s)/2mr and (meb + s)/27, as indi- 
cated in Fig. 11. In case the input and output are at the same frequency, 
as in a negative-resistance amplifier, the noise-figure expressions will 
not be applicable, but (65) will give the shot-noise current at the fre- 
quency of interest. 

Any information-bearing signal will contain a‘number of frequencies 
(in general, a continuous spectrum). The linear nature of the small-signal 


problem enables one to analyze for each frequency separately. 


Fig. 7 Upper-sideband superheterodyne receiver (down-converter). 


VANISHINGLY SMALL 
SIGNALS 


Fig. 8 — Lower-sideband superheterodyne receiver (down-converter). 
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Since vgiq is arbitrarily small, one can write 
Rlv(t)] = Rlvro(t)] + R’[rro() lesic . (41) 


The de and large-signal de components of ® are given by the Fourier 
expansion of ®[vzo(t)]. The Fourier expansion of &’[v,o(t)] is required for 
the signal transmission, that is, for calculating the conversion matrix. 
Both expansions are necessary to calculate the noise figure. 

Assume that the required Fourier coefficients have been calculated, 


CARRIER 
OSCILLATOR 
w=b 


FINITE 
DRIVE 


AUDIO, VIDEO, OR / TPIT f 
OTHER INPUT SERRE nr JUNCTION ———eeeee 0 PUTT 


w=s A 4 w=stb 


VANISHINGLY SMALL 
SIGNALS 


Fig. 9 Double-sideband transmitting modulator (up-converter) 


est 


ONLINEAR 


, LINEARLY RELATED 
f (FOR SMALL SIGNALS) 





LOCAL 
OSCILLATOR 











Fig. 10 — ‘‘Black box’”’ representation of a linear frequency converter. 
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so that one can write 


Rlvpo(b)] = 2. R(mb)e 


m 


R'[vpo())| = * a’ (mb)eim”t 


m 
Then, one can say that ® is made up of a large periodic part with Fourier 
components @(mb) and a vanishingly-small signal part with Fourier 
components 


R(mb + 8s) = ty R’'[(m — n)blv(nb + s). (44) 


The complex amplitudes of the alternating currents, both large and 
small, can be calculated by multiplying gP(w). First, the large-signal 
current will be called ipo, with Fourier components 


iino(mb) qP(mb)R(mb). (45) 


It should be noted that 7, 9(0) differs from the net direct current, io , by 


eee 


| | 


| | 


| | 
LJ Lo tot bo 


| 





Fig. 11 Linear frequency converter used as an active four-pole network 
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the saturation current, 7, : 
ti0(0) 19 + %,. (46) 


Second, a complete specification of the small-signal transmission of 
the frequency converter is given by 


isic(t) = Re > i(mb + s)emtr’ (47) 


with 


i(mb + s) = qP(mb + s) > R'[(m — n)blv(nb + 3). 


Equation (48) can be written in the form 


i(mb + s) = >. Y(m, n; s)v(nb + 38), 


where 
Y(m, n; 8s) = gP(mb + s)R’[(m — n)b}. 
The array Y(m, n; 8) is called the conversion matrix. For brevity, 
Y(m,n;s) = Yan. (51) 


In operation, the linear frequency converter will be terminated with 
admittances Y,, at each of the ‘terminals’? mb + s. A system of current 
generators J,, at each of these terminals will produce currents and volt- 
ages that obey the relationship 

Sm = tm + Um) mn. 


Theref« re, 


S$. = 2 er + V aden dOn ; 


n 


where dn. is 1 if m equals n and 6,,, isO if m does not equal n. One can 
also write 


tm = Dy AmnIn 5 (54) 
where 


| Xen ae DF an + Y ehhan | (55) 


in matrix notation. 
The effect at terminal m of a current generator J, at terminal n can 
be duplicated by a fictitious current generator n»,J, placed at terminal 
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m, where 


oi ag ae (56) 
Amm 
does not depend on Y,, (Thévenin-Norton theorem). This property 
makes nm, a desirable quantity to use in the analysis so that a necessary 
characteristic of the noise figure — that it be independent of the load 
admittance — will appear quickly and naturally. 


XI. NOISE IN THE GENERAL P-l JUNCTION FREQUENCY CONVERTER 


Suppose that the appropriate value of $(w) for a carrier event is ap- 
plied to each terminal of the linear frequency converter. Then the equiva- 


lent current generator at the output mb -+ s will be 


‘es Nmond (nb + 8). (57) 


This sum can be substituted for 9(w) in (26). For either a forward or a 
reverse transit, (26) and (57) give 


J (t) dt = 2q°B A Nmantngn’t eral oto (58) 


a n,n 


The development thus far could be used to treat the nonlinear-resistor 
type of p-n junction. For the general p-n junction, forward and back 
events must also be considered. A forward pulse at é followed by a re- 
verse pulse at fo + U gives 


/ S(t, t’) dt 


o.2 * j(n’—n)t 
2q B 7 Ruan Pinen’é (1 


n,n 


In all applications of (59), the weighted sum over all possible values 
of t’ will be required: 


[ at P(t) [ #, t) dt = 


2q°B 2. aie "fo P(nb + 8) 
nin 


+ P*(n'b + s) — P(nb — n’b) — Pol. 
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Reverse Carriers 


The thermally generated reverse carriers occur at random but at a 
steady average rate 7,/q. Multiplying (58) by this rate gives 


Og = 2qBi, pe | Nmon ty (61) 


since the terms in (58) for which n does not equal n’ are oscillatory and 
give no contribution to the noise power when averaged for a steady rate. 

Equation (61) may be interpreted as simple shot noise applied to each 
terminal and independently converted to the output (except the term 
for n == m., which represents the direct shot noise at the output). 


11.2 Forward Carriers 


The rate of emission [PoR(t)| of forward carriers varies periodically, 
so that an average of (58) over the local-oscillator cycle is required: 


” b ore 9 j(n’—n)b 
(t )forward = 5 [ dtoP oR (to) [208 + NmanMman’e =a . (62) 
an v—ar/b nin’ 
The Fourier expansion, (42), of ®(4) may be inserted in (62). One then 
finds 


(t’)torwara = 29 BPo : R(nb — n'b) nmgntron’ « (63) 


nn 


11.3 Forward-and-Back Carriers 


In similar fashion, one can average (60) over the local-oscillator cycle 
to obtain the mean-square output current generator for all classes of 
forward-and-return carriers: 

2 9,2 by , + 
(U /forward-and-back > 2q B to R(nb — b) NmanNmaon'|P (nb + 8) 
n" n’ . 
(64) 
, , 
+ P*(nb + s) — P(nb — nb) — Pol. 


11.4 Total Shot Noise 
The sum of (61), (63) and (64) is 


(4°) total ee 2qB 4 ls +p | Nmon 


+ q + R(nb — n'b)nmonMnan'LP(nb + s) + P*(n'b + 8) (65) 


nn 


— P(nb — n'b)]). 
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11.5 [mage Conversion Noise 


The frequency converter is supposed to be terminated at the frequencies 
(mb + s)/2m with specified admittances which may or may not generate 
noise. What noise is generated will be transmitted to the output accord- 
ing to the coefficients Mmm . 

The input and output terminations mb + s and mb + s are in a class 
apart, because noise generated in these terminations is chargeable to the 
transducers that precede or follow the frequency converter. The other 
terminations mb + s, with my # m # ms, will be called images. The 
frequency converter (but not the diode) must be held responsible for 
output noise originating in the image terminations. 

The properties of the image terminations as noise generators depend 
upon the physical nature of the terminations and are not uniquely de- 
termined by their impedances. For example, the conductance G,, for 
each of the image terminations mb + s might consist of a resistor at 
temperature 7’,, . Then the image termination noise at the output would 
be 


9 


, » 
e a .R ™ gy 2 7. 
(t )image terminations > t) B > Twin Nmom | 5 (66) 
m 


where the symbol By indicates that terms in m m, or m me are not 
included in the summation. The terminations may be devices that are 
not in thermal equilibrium. If several diodes are used in combination, 
so that they terminate each other at some image frequencies, the noise 
generated in the mutual terminations can be much greater or much less 
than thermal noise. 


11.6 Noise Figure 
The noise output of a linear transducer must be judged in relation to 
the signal output for a given input signal. As a standard value for the 
input signal, it is conventional to use thermal noise corresponding to an 
absolute temperature of 290°K, which will be denoted 7’, . The thermal 
noise current is 1/4k7',G,B, where G, is the source conductance, and an 
rms current of this value will be the standard input signal. 
The noise figure F can be defined by 
F output noise power due to transducer 67) 
y—j|= : : ~ (Od 
output signal power from standard input signal 
The noise figure of the input stage determines receiver sensitivity if the 
gain of the transducer is sufficiently high. For the frequency converter, 
shot noise at output + image conversion noise at output 


F-l1= , ; . (68) 
output signal power from the standard input signal 
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In place of power, we may use in each case the equivalent mean-square 
current generator at the output, which for the standard signal is 
(¢ )etandard tkT BG, Nmsm, i (69) 
lor convenience, one may define 
_ Amon ay Amen 
Mn = oie 
Nm om mem 
Then the total noise figure may be written 


= 9 7 q a R(nb — n'b) nttn’*|P(nb + gs) 
KL sUs n nf 


+ P*(n'b + 8s) — P(nb — n’b)| + 2, Z | bn 


, 


+ re inn 
m GT’, 

The last term of this equation represents image conversion noise. It 
is an important practical problem to design the circuit to minimize image 
conversion noise. But, in principle, this can always be made to vanish 
by refrigerating the terminations, so the subsequent theoretical discus- 
sion will consider only the shot noise. 


XII. NONLINEAR CAPACITORS 


A nonlinear-capacitor type of p-n junction is one in which all emitted 
carriers are collected within a short time after emission. If all emitted 
carriers are collected, Py is zero, since it is the probability of a carrier 
not being collected. Hence, there are no forward-only carriers. More- 
over, the reverse current 7, of the thermally generated carriers must be 
zero. One may show this by considering the diode in equilibrium, when 
the net direct current must be zero; if there are no forward-only carriers, 
there must be no reverse-only carriers. 

Thus, the only shot noise is that due to forward-and-back carriers, 
governed by P(t’). The exact form of P(t’) is not too important. The 
simple rectangular function shown in Fig. 12 will serve to illustrate the 
main result. The maximum storage time, A, will be allowed to become 
very small compared to 1/w, to approach the case of an ideal nonlinear 
capacitor. The function P(w) is, according to (10), 

A 


Pw) =1-[ xe “fice ete 2! ee 
Jo 
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The shot noise due to forward-and-back carriers is obtained when this 
P(w) is inserted in (71). Each term contains a factor 
/ , , 
P(nb + s) + P*¥(nb + 8s) P(nb — nb) = 


1 j(nb + s)A — 3 j(n'b + 8)A — 4 j(nb — n’d)A 


9 


+2 [(nb + 3)" + (nb + 8)* — (nb — n'b)*| { 
) 


9 


‘ / 
(nb + s)(n b+ 8s) + 
»” 
The first-order terms in A cancel. As A goes to zero, i.e., as the fre- 
quency response of the diode is increased, the shot noise goes to zero as A’. 


; 
+ 











>A 


Fig. 12 — Example of a return probability distribution leading to nonlinear 
capacitor action. 


The capacitance, for a given voltage, is proportional to A. To approach 
an ideal nonlinear capacitor, let A approach zero. At the same time, the 
capacitance waveform can be kept essentially unchanged if the emission 
rate is increased (by appropriate adjustment of the local-oscillator volt- 
age). The shot noise then goes to zero with A and we get this result: A 
frequency converter using an ideal nonlinear capacitor has no shot noise 
output regardless of the local-oscillator waveform, the source and load ad- 
mittances, and the image terminations. 

An interesting qualitative interpretation* of this result is that, as A 
gets smaller, an increasing number of carrier events are required to ob- 
tain a given capacitance waveform. A smoother current results from the 
statistical effect of having a large number of elementary events. 


* Suggested by D. Leenov. 
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If the nonlinear capacitance does not arise from the minority-carrier 
storage mechanism that has been postulated, but is wholly or partly due 
to depletion-layer capacitance, there is even less reason to expect noise. 
The depletion-layer capacitance involves widening and narrowing of the 
space-charge region. It requires a relatively small motion of majority 
carriers instead of their emission all the way across the junction, or 
of any subsequent existence as minority carriers. The frequency re- 
response of Cy should therefore be better than that of minority-carrier 
storage capacitance. Also, the elementary events are likely to be repre- 
sentable approximately as forward-and-back current generators, with 
the area under each pulse being much less than q, since the carriers do 
not go all the way across the junction. 

The problem of finite frequency response (A # 0) can be studied by 
putting the expression given in (73) in (71), to get the leading term in 
the shot noise: 

Pe ee AR(nb — n'b)(nb + 8)(n’'b +8). (74) 

6kT.G, an’ 

It is evident in (74) that F — 1 goes to zero with A if AR(nb — n’b) is 
held constant. But, for a finite value of A, the shot noise could be very 
large in the case of significant conversion from high-order harmonic 
images. Hence, a circuit design consideration is to make yu, small for all 
values of n such that nb is greater than 1/A. Short-circuiting of the 
harmonic images is one way of accomplishing this objective. 

Apart from finite frequency response, actual diodes will fall short of 
ideal nonlinear-capacitor action because of the finite lifetime of minority 
carriers. Some minority carriers will recombine before they have a 
chance to return to the junction. The effect of this recombination is 
equivalent to that of a nonlinear resistor in parallel with a nonlinear 
capacitor having infinite minority-carrier lifetime. As will be shown, 
low-noise frequency conversion is possible with nonlinear resistors when 
the local-oscillator current is sharply pulsed. Therefore, low-noise fre- 
quency conversion is possible with a nonlinear capacitor suffering from 
recombination, but the local-oscillator waveform can no longer be arbi- 
trary, and the conversion gain will be reduced. 


12.1 Nonlinear Capacitor Structures 


Four semiconductor structures shown in Fig. 13 exhibit nonlinear ca- 


pacitance, for reasons to be discussed. The first of these can be made 


from a transistor, and thus is a readily available specimen for relatively 
low-frequency (about 10 mc) experiments with nonlinear capacitors, 
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However, the great promise of the nonlinear capacitor is the possi- 
bility that a diode structure can be made to work at higher frequencies 
than transistors. Transistors, as will be explained below, are not one- 


dimensional structures, a fact that aggravates the series-resistance prob- 


lem. The pt-n-n* and p-i-n and the graded p-n junction can be truly 
one-dimensional structures, as long as skin-effect is not important. Of 


ANE ) 


SYMMETR 



























































S scciatiaianteanictte (d) 


Fig. 13 — Nonlinear capacitor structures: (a) p*-n-p*; | 
(d) graded p-n junction. 
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these, the graded p-n junction seems to be the easiest structure to fabri- 
cate by solid-state diffusion of impurities into semiconductors. 


122 p -n-p’ Structure 


The p°-n-p” structure in Fig. 13(a) can be made by connecting the 
emitter and collector leads of an alloy-junction transistor. A symmetri- 
cal alloy transistor (one with emitter and collector junctions of equal size) 
is theoretically preferred for maximum nonlinear-capacitor action. The 
pains taken by the transistor manufacturer to prevent emitter-collector 
short circuits are, of course, completely wasted in this application. 

The emitter and collector junctions thus become, in effect, one junc- 
tion which almost completely surrounds the n region. Because of the 
high concentration of holes in the heavily doped p* region, compared to 
the concentration of electrons in the n region, nearly all of the emitted 
carriers are holes entering the n region, rather than electrons going into 
the p’ regions. Holes going into the n region may recombine and become 
forward-only events. But if the base width 2w is narrow enough, most 
of the holes will escape recombination, either by returning to the junc- 
tion from which they were emitted or by being collected at the opposite 
junction. In either case, such holes become forward-and-return events 
with respect to the junction, consisting of emitter and collector junctions 
taken together. 

The n region is assumed to be of uniform resistivity and thus to be 
field-free except for the space-charge regions at the junction. Hence, the 
motion of the emitted holes is simple diffusion. The diffusion equation 
(13) then reduces to 


(jw + 1/7)p. = DV’p, . (75) 


If one lets x equal zero at the plane of symmetry indicated in Fig. 
13(a), the solution of (75) appropriate to this symmetry is 


Po © cosh rV/ (jw + 1/7r)/D. (76) 


+w, Py equals p, , SO 


cosh tv (jw + L, t) D 
cosh wv (jw + 1/r) D 


Po = Pw (77) 


The current density is given by qDVp evaluated at x w. Then, by 
comparison with (15), one finds that 
L/L. = V (jw + 1/r)/D tanh wV (jw + 1/7)/D. (78) 


According to (18), 1/Z, is proportional to P(w). For comparison with 
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Fig. 14 — One-dimensional versus two-dimensional bipolar (hole and electron 
flow patterns in junction devices, 


(58), recombination may be neglected by setting 1/7 ~ 0, and (64) can 
then be expanded on the assumption that w is small (i.e., that the fre- 
quency response is high relative to w). One finds 


| L] ww 1 /ww \’ 
P(w) « = + + ++ (79) 
Pe a eee a i 


Hence, the leading terms are proportional to those of (72), with w°/D 
playing the role of the time A in which (nearly) all of the emitted car- 
riers return. 

The postulated condition of neutrality of the n region implies that, 
when holes ure emited into the n region, an essentially equal number of 


electrons must enter the n region from the base contact. A voltage drop 


is required to move these carriers, and this may be crudely described as 
a series resistance. The thinner the base layer is made for the sake of 
high frequency response, the higher the series resistance. This dilemma 
is well known in transistor design. As shown in Fig. 14, the electrons move 
in a direction perpendicular to the average motion of the holes; hence, 
two directions in space are involved. Thus, any one-dimensional analysis 
of the transistor is inherently an oversimplification, although sometimes 
a very useful approximation. 


123 p -n-n Structure 


As an introduction to a structure that is truly one-dimensional, note 
that the plane of symmetry in Fig. 13(a) could be replaced by a surface 
that has the property of reflecting holes. A heavily doped n~ region would 
have this property, and this suggests the p’-n-n* structure shown in 
Fig. 13(b). Equation (78) should apply to this structure as well as to 
that of Fig. 13(a). 
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For each element of p*-n junction area, there is a corresponding area 
of n-n* junction that reflects emitted holes and also provides the elec- 
trons to neutralize the holes. Thus, the area of this structure can be in- 
creased in two dimensions without affecting its performance, except for 
a proportionate lowering of impedance level. This is a characteristic of 
a one-dimensional structure. 

The depletion-layer etching technique has been used to make very 
thin germanium p ‘-n-x structures, where x is supposed to be a region of 


° ° ° 18 rex . , . ° 
high recombination.” The unsolved problem in making this structure is 


that of obtaining a satisfactory x region for nonlinear resistance action 
at high frequencies. No such problem exists in making a p'-n-n” non- 
linear capacitor, since a well-understood n” region is required in place 
of the little-understood x region. 


12.4 p -i-n” Structure 


The p’-i-n’ diode is another one-dimensional structure that has a 
carrier-storage behavior described qualitatively by (78) because the 
basic carrier-storage action is quite similar to that of the p’-n-n” struc- 
ture. Thus, holes emitted from the p* region are reflected at the n’ 
junction. In addition, electrons emitted from the n* junction are reflected 
at the p” junction. 

An accurate analysis of the p’-i-n" structure requires consideration of 
the simultaneous motion of holes and electrons, and would represent an 
advance over present p-n junction theory. In the meantime, the qualita- 
tive considerations mentioned above indicate that the p‘-i-n” structure 
should give nonlinear-capacitance action up to frequencies determined 
by the width of the i region. At much higher frequencies, the structure 
will just be a resistance whose value depends upon the density of car- 
riers in the i region, which density is proportional to the forward direct 
current. 


12.5 Graded p-n Junction 


By drawing a smooth curve through the concentration gradients for 
either the p'-n-n” or the p'-i-n” structures, one obtains the graded 
p-n junction shown in Fig. 13(d). 

The graded junction has features that recommend it as a nonlinear- 
capacitor structure. The electric field associated with the impurity 
gradient tends to reflect emitted carriers the steeper the gradient, the 
larger the field and the higher the frequency response of the capacitance. 
Analysis of the frequency response, based on the approximate kinetic 





SHOT NOISE IN p-N JUNCTION FREQUENCY CONVERTERS 981 


theory of p-n junctions, can apply (if at all) only to an intermediate 
region of bias where emission is significant but not so copious that the 
electric field is altered by the forward current. 

The frequency response of the capacitance will depend on bias, being 
very high for reverse voltages and deteriorating gradually as the voltage 
approaches and enters the forward region. However, microwave experi- 
ments suggest that the series resistance will generally be a much more 
important source of loss and noise than any frequency limitations of 
the nonlinear capacitance. A steep impurity gradient makes for a series 
resistance that is low both in absolute value and in relation to the ca- 
pacitance. However, the capacitance per unit area increases as the 


gradient increases. 


XIII. NONLINEAR RESISTOR NOISE 


The nonlinear resistor type of p-n junction has the property that none 
of the emitted carriers return, except possibly some carriers that return 


in a time negligibly short in comparison to the frequencies under con- 
sideration. Within the frequency range of nonlinear resistor action, the 
admittance due to emitted carriers is real and independent of frequency. 

In spite of the absence of return pulses to cancel the shot current of 


emission, the shot noise delivered to the load can be made very small if 
the local-oscillator current flows in short pulses. The plausibility of this 
result will be argued by representing the p-n junction as a noiseless ad- 
mittance in parallel with a generator of shot current pulses. The shot 
current and the time variation of the admittance are governed by the 
local-oscillator drive. For the nonlinear resistor, the admittance is a pure 
conductance and the shot current to be considered is that of emitted 
carriers that do not return. 

In a frequency converter, the nonlinear resistor is connected to a linear 
passive network which includes the terminations at all the signal fre- 
quencies. If practically all of the shot current flows at a time when the 
junction conductance is very large compared to the network admittance, 
the shot current will be short-circuited through the junction conductance 
and little noise will be delivered to the network. 

Another way of describing the situation is to consider that shot noise 
results because one cannot predict with certainty that a carrier will be 
emitted in any given short time interval. When a local-oscillator voltage 
is applied to a junction, it is known that the sum of the carrier currents, 
in spite of their individual unpredictability, is very nearly equal to a 
definite macroscopic current waveform that can be determined from the 
junction characteristics. When this macroscopic current consists of short 
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Fig. 15 Pulsed local-oscillator current waveform giving partial reduction 
of shot noise. 
sharp pulses, it is possible to predict that many carriers will be emitted 
during the pulses and practically none during other times. Unpredicta- 
bility, hence shot noise, is greatly reduced for a pulsed current waveform 
like the one shown in Fig. 15. 

The principle of using pulsed local-oscillator current can be applied 
in the design of nonlinear-resistance frequency converters. In a simple 
design, for example, it would be preferable to use shunt conductance 
rather than series resistance to attenuate the power from the local oscil- 
lator. More sophisticated mixers may use filters and additional diodes to 
shape the local-oscillator waveform. 

The image terminations chosen below to illustrate the waveform effects 
should not be regarded as the optimum conditions for practical use. For 
example, the main image frequency might weil be open-circuited to 
minimize conversion of thermal noise from the series resistance, while 
short-circuiting of this image will be assumed for mathematical sim- 
plicity (in the case of short-circuited harmonics). 


13.1 Short-Circuited Harmonics 


As an example of a circuit permitting low noise and good signal trans- 
mission, the mathematically simple case of short-circuited harmonics 
will be considered. Another way of defining this case is to say that the 
local-oscillator voltage waveform is purely sinusoidal (plus a de com- 
ponent, vo). Thus, 

v(t) = vp + % cos bt. (80) 


The exponential emission function, (1), will be assumed, so that 
gPoR(v) = te" (81) 


= jePragPrvonnt, (82) 
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The constant 7, is necessarily equal in magnitude to the thermally gen- 
erated reverse current, so that no net external current flows when 1 
equals zero. From (81), 


Br 


gPR'(v) = Bis 
which, with (80), gives 


Brg Svpcost 
€ . 


gPoR'(v) = Bis (84) 


The Fourier expansion of (84) involves modified Bessel functions of the 
. . 16 ry 
first kind, denoted by /, . Thus, 


gP (nb) = Bi,e"'°T,, (Bus). (85) 

From (50) and (85), one finds that the conversion matrix is 
, - Bro 5 vo 
Y(m, n; 8s) = Bi,e’ I m_*,(Bus). (86) 


To take a specific example, conversion between two frequencies dif- 
fering by b/2z will be considered [such as (b + s)/2x and s/2z]. All 
other signal frequencies (images) will be assumed to be short-circuited. 
Then the conversion matrix reduces to 


Y (m,n; s) |! = Blt, + 1%) 


where 


me 1;(Bvs) 


wy = 
y To(Bv) 


and io, the direct current, is introduced in view of the relation 
. . Bu > 
lo = isle °Lo(Bu,) — 1), (89) 


which may be obtained from (11). 

For minimum conversion loss, the four-terminal network represented 
by (87) should be used with source and load admittances that simultane- 
ously match the input and output admittances of the network. These 
admittances are, of course, real for a nonlinear resistor and are found, by 
standard methods,” to be 


Yin = You = G, = Gi = Bie” o(An)V/1 — yx. (90) 


Since the images are short-circuited, all the y,’s defined in (70) are 
zero except 
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l+vVil-¥ 
ling = — 2 y. (91) 
Y 
For the noise figure, algebraic manipulation that will not be detailed 
here gives 
Tu +0 


»? 


F-— 1] 
Tout w 


where 


and 


; 


bo +t 


6 = (94) 
Equation (90) gives the dependence of F, upon the local-oscillator 
drive (represented by u) and the direct current, represented by 6. The 
quantity @ defined in (94) is the ratio of the thermally generated current 
to the current carried by emitted carriers. 
For any value of @, there is a certain value of u, 


, 


u Ve +6é-— 6, 


that gives a minimum noise figure 


r 


F ae 96) 
ee a (90) 

As @ approaches zero, Fin approaches unity. At the same time, the opti- 
mum value of u approaches zero. The optimum loeal-oscillator voltage 
goes to infinity and the conversion loss approaches unity (that is, no 
loss at all). This result shows that low noise figure and minimum con- 
version loss are not incompatible. 

For most practical microwave diodes @ is quite small, since 7, may be a 
tenth of a microampere and the direct current 7p is of the order of a milli- 
ampere. An approximate formula is therefore relevant: 


Fuin = 1 + 2V/0; 6< 0.01. 


. in . mm 
For 6 = 10°, Fmin = 1.02 = 0.1 db. Thus, the observed thermally gen- 
erated currents would not produce much noise and probably are not 
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responsible for much of the observed noise figure of crystals, except pos- 
sibly at elevated temperatures. 

This analysis has considered only the shot noise of the emitted car- 
riers. In actual diodes, the series resistance and the depletion-layer ca- 
pacitance impair the noise figure. The depletion-layer capacitance should 
be essentially noiseless, but, in combination with R, , it sets bounds on 
the usable impedance level of nonlinear resistance of the emitted car- 
riers. With the impedance level thus established, R, causes a definite 
signal loss and contributes thermal noise. Furthermore, PR, interferes 
with short-circuiting the local-oscillator harmonics or otherwise obtain- 
ing the pulsed local-oscillator current waveform essential for low shot 
noise. 


13.2 Open-Circuited Harmonics 


When the harmonies of the local oscillator are open-circuited, the 
local-oscillator current is sinusoidal (in addition to a direct current). It 
will be shown that this waveform is unfavorable for noise, although it 
permits a low conversion loss. 

The emission rate and the local-oscillator current are simply related: 


Vt 
gPoR(t) = trol) + 4, = (to + 4) (1 + + # COS wt), 98) 


where 
0O<Et <1. (99) 


The reason for introducing the quantity & will be apparent later. The 
Fourier analysis of Pof(d) is obtained by inspection of (98): 


qPoR(nb) = % + 4, ifn = 0 
if n= + | 


otherwise. 


Since an exponential emission rate is being considered, ®&’ equals BR 
and, from (50), 


Y(m,n;s) = Bltyo + 2,) ifm=n 


S(to + 2, )E ., 
a — ifm=n+1]1 


otherwise. 
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The admittance representation of a frequency converter lends itself 
readily to consideration of cases where the image frequencies are short- 
circuited, since the elements of Y(m, n; s) corresponding to the images 
may then simply be omitted. One finds, however, that, with open-cir- 
culted harmonics and short-circuited images, the conversion loss is at 
least 3 db, quite apart from the further losses due to the parasitics. As 
an absolute theoretical limit, this conversion loss appears to be ex- 
cessive. In seeking an escape from this limit, one naturally wonders if 
open-circuited images might be appropriate for open-circuited harmonics. 

Indeed, open-circuited images permit unity conversion gain. To deal 
with open-circuited images, we may invert the infinite-dimensional 
matrix || Y(m, n; s) || to obtain the conversion matrix in impedance 
form: 


l—m| 


anit)! 


ob 


¢ 
s 
— 


‘ ( l 
Z(m,n; 8) = ———~ (102) 
B(t0 a 1.) l 

The images can now be open-circuited by considering only the 2 K 2 
impedance matrix pertaining to the signals. This matrix may be inverted 


to give the admittance matrix for open-circuited images: 


' in || 
| Y || = Blto + 1.) || 1 é] 


{fe (103 
ae 


For the conversion between two frequencies differing by b/22, with all 
images open-circuited, one has, in analogy with the short-circuit case, 

Yin = You = Gt = G. = Blin +4)V1 — &. (104) 
The noise figure is found to be 


‘Mee 
i — |] = ; 
. T, 21 


where 
z= V1 — #. (106) 
For any value of z, the lowest noise figure is obtained when @ equals 0, 
that is, when the thermally generated current is negligible. For @ = 0, 


the minimum noise figure is 3.25 = 5.1 db; this occurs when z equals 
0.45 and the conversion loss is 4.2 db. Evidently the noise performance 
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with open-circuited harmonics is inferior to that obtainable with short- 
circuited harmonics. 


XIV. CONCLUSIONS 


The nonlinear capacitance of a p-n junction can be used in amplifiers 
and amplifying frequency converters. To the extent that the junction 
exhibits pure capacitance, it will be noiseless in any of these applications. 
Suitably designed p-n junctions are remarkably good nonlinear capaci- 
tors at microwave frequencies, and preliminary experiments indicate 
that good noise figures can be obtained. 

A pulsed local-oscillator current is required for low-noise frequency 
conversion with nonlinear resistance p-n junctions. This requirement is 


a logical consequence not only of p-n junction theory, but also of earlier 


metal-semiconductor rectification theories that assumed emission of 
carriers across a barrier. 
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Gain and Noise Figure of a 
Variable-Capacitance Up-Converter* 


By D. LEENOV 


(Manuscript received March 31, 1958) 


The performance of a p-n junction nonlinear-capacitance diode as a 
low-noise, amplifying frequency converter is analyzed for the case in which 
the output signal frequency is many times greater than the input signal 
frequency. The diode is represented by an equivalent circuit consisting of a 
time varying capacitance and a constant series resistance. Formulae for the 
maximum available gain and the noise figure of the cireuit are obtained, 
and representative numerical values are given for the noise figures of sys- 
tems incorporating such diodes as preamplifiers. 


I. INTRODUCTLON 


Since microwave receivers frequently must operate at a very low in- 
put signal level it is important to obtain the smallest possible noise 
figure. One way of accomplishing this is to use a low-noise amplifier at 
the input of the receiver. In this article the role of a nonlinear-capaci- 
tance p-n junction diode in performing such a function is analyzed. 

The performance of a variable capacitance as a frequency converter 
has been extensively discussed in the literature.'” Theory indicates 
that when the variable capacitance is used for conversion between a low 
frequency and an upper sideband (a noninverting frequency), the maxi- 
mum available gain is fe/f; , fi; being the input frequency and f, being 
the output frequency. Hence in up-conversion (modulation) a power 
gain results, with the power added to the input signal being supplied 
by the beat oscillator. In down-conversion a power loss results. (These 
statements apply when no signal power is transferred at the lower side- 
band or at any of the harmonic images). The situation is different when 
one of the two frequencies involved is a lower sideband (an inverting fre- 
quency). A negative resistance may then appear at both sets of terminals, 

aaretee in part by the U.S. Army Signal Corps under contraet DA 36-039 
sce-/0224. 
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resulting in regenerative gain or stable oscillations.” Large gains may be 
obtained when going either up or down in frequency. The present paper, 
however, is restricted to a consideration of upper sideband frequency 


conversion. Analysis of lower sideband operation is now in progress. 

In this article the frequency conversion performance of a nonlinear 
capacitance (N-C) diode with parasitics 1s analyzed. In Section V the 
maximum available gain (MAG) is calculated and is compared with the 
MAG predicted for an ideal device. In Section VI, an equation for the 
noise figure of an N-C diode is obtained, taking into account only the 
thermal noise of the series resistance. Over-all noise figures are given in 
Table I for three types of receivers with diode preamplifiers, operating 
at specified input and output frequencies. 


Il. THE EQUIVALENT CIRCUIT 


For the calculations to be made here, the p-n junction might be repre- 
sented by the lumped-parameter equivalent circuit shown in Fig. 1. 
Here R, and Cy are as defined and the “‘variable admittance’, shown as 
lumped parameters, accounts for the current carried across the junction 
by the motion of carriers through the space charge region. A calculation 
of the mixing action of the diode, using Fig. 1, would involve a rather 
complicated formulation.’ However, the silicon and germanium re- 
tarding-field diffused diodes now under investigation are fairly accurately 
represented by the equivalent circuit shown in Fig. 2. (The design and 
properties of these devices are discussed in other papers).”° In this 
diagram, Cmin represents the smallest capacitance attainable, determined 
by the limitation on reverse bias voltage imposed by breakdown, and 
Cmax is the largest value, determined either by physical limitations or by 
choice. 
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Fig. 1 — Equivalent circuit for nonlinear-capacitance diode. 
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Fig. 2 — Simplified diode equivalent circuit. 


Ill. ASSUMPTIONS FOR CALCULATION OF GAIN AND NOTSE FIGURE 


In this article the gain and noise figure of an N-C diode frequency 
converter are calculated according to the following assumptions: (1) 
Upper sideband operation is assumed; that is, an input frequency /; is 
converted to an output frequency fe. using a beating oscillator at the 
frequency fe — fi. (2) The lower sideband, of frequency f. — 2f; , and 
all the harmonic images are assumed to be open circuited, implying that 
no power is delivered by the frequency converter at these frequencies. 
(3) The equivalent circuit of Fig. 3 is used. Here, filters are indicated 
which restrict transmission in the left-hand branch to a narrow band 
about the frequency f; and transmission in the right-hand branch to a 
narrow band about fe ; & and & are signal voltages and &\y; and &y» are 
noise voltages at frequencies f,; and f. respectively. The equivalent cir- 
cuit of the diode which was discussed in connection with Fig. 2 is as- 


sumed applicable over the operating range of pump voltage and frequency. 
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Fig. 3 — Equivalent circuit for the calculation of gain and noise figure. 
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Using these assumptions, results have been obtained in a simple form 
which can serve as a guide to diode development and application. It is 
not supposed that the mode] used here constitutes a very accurate charac- 
terization of the frequency, converter operation. In particular, assump- 
tion (2) is theoretically possible, but there would be considerable prac- 
tical difficulty in satisfying these conditions in a microwave circuit. 


IV. GENERAL EQUATIONS FOR FREQUENCY CONVERSION 


In subsequent sections we shall need to calculate, for the circuit 
shown in Fig. 3, the power delivered to the load at frequency fo when 


specified signal voltages at frequencies f; and fo are applied. Hence, 
what is desired is a means of calculating the current 722 for given signal 
voltages. For the specific applications to be made later on, we shall ob- 
tain in this section a solution in general form for the currents 7; and 2 
resulting when signal (or noise) voltages & and &» , at frequencies f; and 
fe respectively, are applied to an N-C diode. 

It is possible to express the relation between currents and voltages of 
a frequency converter by means of an admittance matrix; this will be 
done here for a nonlinear-capacitance frequency converter. Let the beat 
oscillator voltage be represented by Vo, and the combined signal volt- 
ages by 6V. If noise is of any consequence, one may surely assume 
5V <«< Vo. Hence we may write (1), representing the charge on the 
capacitor as a function of voltage, in the form of a Taylor expansion, 


Q(V) = Qy, + (22) Fs (1) 
Vo 


The subscript Vo signifies that Q or its derivatives have the value deter- 
mined by Vo alone, i.e. 5V equals 0. It is assumed that terms of higher 
order in 6V are negligible, so that the relations to be derived will be 
linear in the signal quantities. (For a more complete theory of linear 
frequency converters, see Ref. 6). 
To evaluate (1) we expand (0Q/dV),, in a Fourier series, 
Ss HCO + Can = ea, (2) 
ay n=—0o 
with the beat oscillator voltage assumed to be periodic, with pulsatance 
(angular frequency) b. The signal voltage 5V may involve all the har- 
monies of the beat oscillator pulsatance mixed with the signal pulsa- 
tance s. Hence it is written 


0 
bV ca ~ > y giar tee (3) 
aoa mb+s . e 
m=—o st 
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Referring to (1), it is seen that the first term on the right-hand side con- 
tains only pulsatances which are harmonics of b. The second term, which 
we designate 6Q, contains the various signal pulsatances. It is given by 


I a iI(m-+-n)b-+e) 
5Q = ( ’) 6V = DD. Cavey” (4) 
Vo 


te) J § nm st 


Since we shall want to select individual pulsatance terms from the right- 
hand side of (4), we set m + n = l. The result is 


yt ae See A Oe teas (5) 


im st 


Also, being interested in the current components, we use the relation 
ty = Jog. , obtaining finally 


Libts = Yi (lb + SC) mUmb+s (6) 


m 


(we have here stopped considering the terms containing —s, because 


ims = t-w+s ). Equation (6) is equivalent to the matrix equation 


tibis = > Y itlaters (4) 
m 


where the elements Y), are given by 
F ii - (lb + syC, m-« (8S) 


The admittance representation is convenient to use when only a few 
signal voltages have specified values and the rest are zero (i.e. short- 
circuited). In the case of the equivalent circuit used here, none of the 
voltages can be short-circuited, because of the diode series resistance. 
The possibility remains, however, of presenting open circuits to all the 
image pulsatances (all the mb + s) except for the two involved in fre- 
quency conversion, s and b + s. This means that all the currents except 
i, and t),. will be assumed to vanish. The relation between voltage and 
current can then be simplified when it is expressed in the impedance 
matrix form. Hence we must obtain the matrix inverse of |) Y ||, which 
is || Z || = || YI. 

There is at least one case in which this matrix inversion can be readily 
performed. If dQ/dV equals Cy + 2C, cos bt then only the coefficients 
Cy and C, in the expansion (2) do not vanish. The matrix of (8) is now 
limited to elements on the diagonal and once removed from the diag- 
onal. As a product of infinite matrices (in these arrays only the nonzero 
elements are indicated), it is written 





THE BELL SYSTEM TECHNICAL JOURNAL, JULY 1958 


j(—b + 8) 
js 
j(b + 8) 
j(2b + s) 


The second matrix on the right-hand side of (9) may be factored to 
obtain (10), 


where ¢ = (1/Co. Equation (9) can be written symbolically as 
Y || = || jb + 8) |] || Cum |], 
and, by a rule of matrix algebra, the impedance matrix is given by 
Z\ = i Yi" ee. j(lb + 8) I". (11) 


The first matrix on the right-hand side of (9) is diagonal, hence its in- 
verse is easily obtained. The inverse of the second matrix is also known." 
If a matrix has the form 


M ;; 


l 
= (Q; otherwise, 
the inverse matrix is given by 


(M')is = =, (—sr". (12b) 


l 
l 


This result can be used to obtain || Cy» ||’, if we set ¢ = &/(1 + &) 
[see (10)]. If it is now assumed that all pulsatances in the spectrum, other 
than b + s and s, are open-circuited, the impedance matrix reduces to 
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Co 1 


jb + 8) 


To simplify future work, we shall use the abbreviated representation 


Zu Zi 
Z. = , (14) 
Zn Zx 
The matrix Z, represents the capacitative impedance of the diode. 
The total impedance Z must include the series resistance R; it is given 
by 


Zuit+Rk Zi. 
Zx Z2+R 
By the definition of an impedance matrix we have 
vy Zuitk Zi. 
Zo Z2o+ Rk le 


Here 7, and vz are the voltages across the diode terminals at the input 
and output frequencies, respectively, and 7; and 72 are the corresponding 
currents. It follows from Fig. 3 that these two voltages are given by 


V1} &1 - 1Z ¢ 
Vo &e ‘ loL | 


(17) 


Substituting (17) into (16), and rearranging, we obtain 


& Zu + R + Le Zi2 


&o Zu Zo + R+ Z, 
The solution for the currents is written in matrix form 


ty Zi + R + Za Lie 
te Zx Z2 +R + Z, 


where the reciprocal matrix is given by 
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Zu + R + Za Z12 


Za Zo + R + Z, 


Zen + Rk + Z, —Z 
l 
D 


—Z Zi — R + Za 
and D is the determinant of the matrix: 
D = (Zu + R + Za) (Zoe + R -+- Z1) ag ZyoZa . (21) 


The output current, and hence the output power, may be determined 
from (19) when the voltages &, and &» are specified. This equation will be 
used as a starting point for calculations of gain and noise figure. 


V. GENERAL EQUATIONS FOR MAXIMUM AVAILABLE GAIN 
We shall calculate the gain of the frequency converter by using the 
condition 


(22) 


where & is the signal voltage. The gain G is defined by 


G power absorbed in Z, | i. |’ Re Z, (23) 
7; = - - = = r ° “a 
power available at input | § |?/4 ReZe 
We shall caleulate G with the aid of (19), (20) and (22), and then deter- 
mine the values of Z,_ and Z, leading to a maximum G, designated by 
Gis . 
We first solve for 7, by substituting the conditions of (22) into (19) and 
obtain 
l 
—— JZn8. (24) 
| tia 
Hence, for the gain we have 


9 9 


| Z “ | & * R,/ D r 


9 


Zu | 


} = 4RoR, | 
| & |2/4Re . th D - 


which, with the aid of (21) becomes 

4RoR, | Zn | 
~ [Zu + RF Zo)Zn + R + Zi) — Zabn 
To maximize G, we first impose the tuning conditions 


jimZeq = —Zn, jim Z, = —Zn. 


G (26) 
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[Note that, according to (13), the quantities Z;, and Z2. are pure imagi- 
nary numbers]. As a result, we obtain 
{RoR, Z> ¥ 
(R + Re)(R + Ri) — ZyZm)? 


\ 


(27) 


Here Rz is Re Z, , Ry, is Re Z,, and Zy2Z> is a real number (since it is 


the product of two imaginary numbers), and we may dispense with the 


absolute magnitude sign in the denominator. 

To obtain Giuax we must maximize the right hand side of (27) with 
respect to Rg and R, . The values of Rg and Rk, satisfying this condition 
are the matching values, and will be designated by Ry’ and R,’. They 
may be obtained by solving the pair of equations 

0G 
. 0, 
OR, 
0G 
— 0. 
ok L 


On solving these we obtain 
9 R°K* + RR, 

R+ R, 

RK’ + RRe 

R +- Re 
where K = V1 — ZyZ,/R*. These equations give the value of Rez 
which maximizes G for any selected value of R, , and correspondingly 
for R,. The matching values R,’ and R,’ satisfy (29) simultaneously. 


Rg 


R L 


They are 
R, = R,’ KR. (30) 


The equality of the input and output matching resistors could have been 
predicted from the form of (27) for G, which is completely symmetrical 
in Re and R, ; i.e. an interchange of Rg and R, , whatever their values, 
does not alter the value of G. Thus, let us suppose that we determine 
R,’ and R,’ experimentally by trial and error, and assume that they are 
a unique pair. Interchanging them must leave G a maximum, hence 
R,’ equals R,’. 

An expression for Gmax May now be obtained by substituting (30) into 
(27). We find, noting that ZZ, equals — rik — i). 


nies = - - r- > — ~ - a 31) 
(RK +1)? — ZudaP RAK + 1? m 


Equation (31) expresses the maximum available gain as a function of 
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the diode parameters # and Chin , and of the time variation of the diode 
capacitance, through the quantities Co and C,. For a given diode the 
first two quantities are assumed fixed, while the last two may be varied 


within limits set by their status as Fourier coefficients of a real, positive, 
periodic function of time. In the Appendix, Giax is evaluated explicitly 
in terms of R and Cnin for the case that Z, is given by (13). The result 


1S 


’ fe l 
Guax , 9) 


Si(a+ VY + 2°) 
where V1 + 1/2? K as introduced in (29), and 
Awl ae R, 


with \ a numerical factor depending upon Cy and C, , and 


& = Vow = Vs(b + 8). 


To simplify future analysis, we write 


where 


l 


as = a figure of merit for the diode. 
2rR.Cmin . 


The expression for x in (33) is convenient for evaluating Gy, from 
(32). We shall first consider low-frequency operation, where f is very 
much less than f,/A. Then x is very much less than 1, and 


Ges ~ fo fi . 


As x increases, Gmax/(f2/fi) monotonically decreases from its asymptotic 
value of 1. This is shown graphically in Fig. 4. Since x = Af2rR,Cwrin , 
the best operation is obtained for minimum values of R,, Cin and XA, 
or & maximum value of f./A. In the Appendix it is shown that, for the 
type of operation under consideration, the minimum attainable value of 
\ is 5.83. 

We next consider high-frequency operation, characterized by 
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f> f./r or x > 1. It is then convenient to rewrite (32) in the following 
form: 


fe 


wel +4 1+! 
‘ = 


As x approaches « , (34) becomes 


' fy f 2 
lim, G nse es Giim = = “Be ( : -) . 
rei tf?  \2nf, 


Since we would like to have G)im as large as possible, it is evident that 


y 
Ginex 


f./X should be large, or, correspondingly, that R,, Cmin and should 
be small, as was true for the low-frequency limit of operation. 

Equation (35) indicates that, for a given input frequency f; , the gain 
should approach a limiting value as fe approaches «. This behavior is 
shown graphically in Fig. 5. It is noteworthy that Gij. decreases with 
increasing f,; , and in particular that, for f; 2 f./2A, Giim < 1. This places 
a fundamental upper limit on the input frequency, if power gain is to be 
obtained. 

The above results show how the maximum available gain of a diode 


amplifier is affected by parasitics. Of greater significance in characteriz- 
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Fig. 4 Ratio of MAG of a diode with parasites to the MAG of an ideal 
variable capacitance, plotted against the parameter z. 
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Fig. 5 — The ratio Gmax/Giim versus z, the input frequency f; being kept con 
stant while the output frequency f2 varies. 


ing its performance is the noise figure, which is discussed in the next 
section. 


VI. GENERAL EQUATIONS FOR NOISE FIGURE 


A p-n junction diode is thought to contain two sources of noise power, 
one being shot noise at the p-n junction, the other being thermal noise 
in the series resistance R of the bulk semiconductor. It has been shown’ 
that the shot noise of the p-n junction of a nonlinear-capacitance diode 
approaches zero so long as the frequency is high enough for recombina- 
tion to be negligible, and not so high that the p-n junction no longer 
exhibits purely capacitative behavior. Hence in the following calculation 
of noise figure only the contribution of the thermal noise of R is con- 
sidered. 
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The noise figure of an N-C diode frequency converter is calculated 
using the equivalent circuit of Fig. 3 and the assumptions of Section 
III, together with the additional assumption that the noise generators 
at the frequencies f; and f. are uncorrelated. 

The noise figure is calculated in accordance with 


noise output originating in therraal noise of R 


noise Output due to thermal noise at 290°K in source resistance 


Thus, as a “standard signal’, one uses the thermal noise output of the 
source resistance, assumed to be at a temperature of 290°K. The noise 
output of the diode is assumed to arise from two components of the 
thermal noise of R. One of these, the thermal noise emf &y;, sets up a 
current of frequency f; in the left-hand loop of Fig. 3, which, by the 
frequency-converting action of the diode, is partly converted into a 
current 22 , of frequency fo ; this then passes into the right-hand loop and 
introduces noise power into the load. The other component, & 2, also 
causes a current 7, of frequency f., to flow through the load. The 
noise power received by the load is determined by these two currents, 
which are assumed to be uncorrelated. 

The noise power can be readily calculated from (19) and (20). The 
current 72 due to &y; is given by solving 


dy Zo + R + Zi —Z i 
l 
D 
} lo —Zo Zi + R + Ze 


Similarly, the current 72 due to &y2 is obtained from 


| ay Zn + R+ Z, —Zy. 
l 
D 
le | — Zo Zi + R sa Za 


(37b) 


Solving these, we obtain 


P Bs 
Sm # 0: = ~D Zubn ; (38a) 


a P 
Ene a 0: te = D (Zi -+- R + Ze)Gue. (38b) 
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The noise power delivered to the load is then: 


a 4kTBRR : ~.i9 oe 13 
(to )R, = D ; =f Zu -- R + Za yy -+- Z2 ys (39) 
! ' 
in accordance with the assumption that the currents in (38) are uncor- 
related. Following (36), the noise figure is given by 


(ig” dR, 


~ 990kBG’ 


(40) 


where 290kB is the available thermal noise of the source resistance and 
G is the gain of the network under the same conditions, defined by (23). 
The gain was calculated in Section V, with the result in (25), given here 
by 

Z ' 


G = 4RaR (41) 


Combining this with (39) and (40), we obtain 


R\|Zu+R+ Zo? +|anl 7 


: 42) 
Re Zo 290 a 


The noise figure here depends upon the diode structure, through R, 
Z\, and Zs, ; upon the beat frequency oscillator waveform and the operat- 
ing frequencies, through Z,, and Z ; and upon the input termination, 
through Z, . (Note that F is independent of the load impedance.) Even 
if all the other quantities are fixed, considerable variation in performance 
can be obtained by varying Z;, . In this presentation, the noise figure is 
calculated for the value of Zg which gives maximum available gain, and 
for the value giving minimum noise figure. 

The noise figure corresponding to the condition of maximum available 
gain is obtained by inserting the appropriate value of Z,_ into (42). In 
Section V this was shown to be Ze = KR.— Zy,. Equation (42) then 
gives 

1R(K +1)? +|Zn)? 7 
F-1l=-— ; : (43) 
K 22 y 290 
On the other hand, from (31) we have 


) 


| Za |" 
RXK + 1)?" 


y 
Giiex 


Hence, (43) may be written 


ie Whe 
Py oe (: T a) 290 ° 
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Before proceeding further, it is interesting to note that (44) can be 
obtained by a simple intuitive analysis, making use of (36) directly. We 
assume that the diode series resistance R generates noise emfs equal to 
V4kTBR at the two frequencies f; and fe. Due to the emf at fre- 
quency fz, the load receives noise power proportional to R. The noise 
power generated at frequency f/f; is also proportional to R; it is converted 
to frequency fe with a gain G, so that the load receives additional noise 
power proportional to GR, making the total noise power from the diode 
proportional to (@ + 1)RT. This is to be compared with the noise re- 
ceived at the load due to a matching resistance at the input. This match- 
ing resistance Re = KR, and the corresponding noise power received is 
proportional to 290GAR. Hence, according to (36), 

_G@+turT 1 (1 es. ) T 
290GKR K Gmnax/ 290° 

Kquation (44) can be expressed in a more useful form by substituting 
an equivalent expression for K. From (34) and (35) it follows that 

Gmex = Gi , (45) 


(1 + K)?° 


Hence 


Giim \" 
K = 2(@: ) — 1. 


Substituting this into (44) gives 


l I T 
si - (Gus ) mi ( ¥ as) 290 ° 


l 


y 
Ginax 


This is the noise figure corresponding to a matching input impedance. 
In using (47), Giax and Gjim may be determined for any set of operating 
conditions, with the help of (84) and (35). Two limiting expressions ob- 
tained from (47) are of interest: 


, Eo | T 
GF ities <K Giim: Ik = l + 5 (¢ - ) (: + ) (48a) 
~ Tlim 


| T 
Gieiex —G im: F — ii l - . ( 18k ) 
a ) _ ( ie a) 290 


Hence, when Gyax is very much less than Gj. , 7 is not much greater 
than 1. When Grax approaches Giim , becomes approximately equal to 
to 2. Thus, even under the most unfavorable conditions, the theoretical 
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noise figure will not be much more thau 3 db if there is a significant 
amount of gain. 

The minimum noise figure may also be found from (42), and applying 
first the tuning condition jXg = —Z, , we obtain 


R (R + Re) + 12Zn\ T 


ne ee 
Re | 22 & 290 


(49) 


The value of Re making F a minimum can be found from (49) by the 
standard procedures of differential calculus. The appropriate value of 
R, , represented by R,”, is 


r 2, 2) 1/2 Ps 
he = ( Z | + Rk). (50) 
Here Ginax is, as before, the maximum available gain, obtained with the 


matching value of Rg. On comparing (31) and (45), one determines 
that 


| Za ‘ tR’Giim « (51) 
Substituting (50) and (51) into (49) gives 


46. +1)" +17T 


2Giim 290 


F ote = l + 
(52) 


T( 1 
~] pr 
sith’ x0 (Vee + og + ) 


The approximate form is valid when Gin >> 1. Equation (52) indicates 
that a noise figure of close to zero db is theoretically attainable. 


It is also important to determine the maximum gain Gy attainable 
under the conditions leading to minimum noise figure, as Gy may be 
needed to calculate the over-all noise figure of a system using a diode 
preamplifier. To obtain Gy , we substitute into (27) the value Rg = Ra” 
given by (50), together with the value of R; that makes G a maximum. 
This quantity, represented by R,”, is found from (29). In the present 
case it is: 

R,” = RIK’ + RRo” (53) 
R+ R,” 
Substituting (53) and (50) into (27) gives Gy . Making use of the defi- 
nition K = VW1 — ZwZo,/R? and | Z|’ = 4R’Giim given in (51), we 
find 


1 Giim 


7 54) 
1+ R/R,” Ro”/R + K? (54) 


Gy = 
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ye . , , . . 1/2 . ° . : » 
Using the relation K = 2(Gjim/Gmax) ~ — 1 given in (46), equation (50) 
and the assumption Gi; >> 1, we obtain the approximate expression: 


1Giim l 


Gu = 49. a Giim \"" *} . (88) 
2VvG im 2 " = l + 
‘ - t (f= 2 Giim 


We now consider two special cases. For Gyax sufficiently small, 


Gin / 
and —— > WGhiim- 


Tmax 


Therefore 


For Ginx = Giim 


(56b) 


The equations obtained for the noise figure have been evaluated for 
certain receiver systems, and the results are given in Table I. The exam- 
ples include, respectively, an N-C diode up-converter followed by (1) a 
travelling-wave tube amplifier operating at 3 kme with a noise figure of 
1.5 db; (2) a point-contact superheterodyne converter plus IF amplifier 
stage operating at an input frequency of 10 kme with a noise figure of 
7.0 db; (3) a system similar to (2) operating at an input frequency of 
55 kme with a noise figure of 10 db (a value that may seem optimistic 
but is probably attainable by combining the best point-contact diodes 
with the best rr amplifier). Values for over-all noise figures of each system 
are obtained from equations (47) and (52), and compared; these corre- 


TABLE I—NolIsE FiGuRES FOR CERTAIN RECEIVER SYSTEMS 
160 kme A = 6.0 T = 290°K 


Over-all Noise Figure of System, 
"system (db) 


Output Noise 


Frequency, fs Figure, F2 Input Frequency, Input Frequency 
(kme) (db) fy = 500 mc f, = 1000 me 


Receiver 
System 


Fa Fg Fa Fe 


51 2.48 
5 2.20 
94 2.10 


(1) 3 1.5 1.40 1.40 y 
(2) 10 7.0 1.24 1.20 . 
(3) 55 10.0 1.20 1.14 


noise figure of system corresponding to maximum available gain for diode. 
noise figure of system corresponding to minimum noise figure for diode. 
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spond to the conditions of maximum available gain and minimum noise 
figure, respectively. The difference between them is not significant in 
most cases. The calculations were made assuming A = 6.0, 7) = 290°K 
and f. = 160 kme, the latter figure being obtainable with recently pro- 
duced diffused silicon diodes. 


VII. CONCLUSIONS 

Equations for the maximum available gain (MAG), and the noise 
figure have been obtained for a nonlinear capacitance diode containing 
parasitics. Because of the series resistance and minimum capacitance, 
the MAG reaches a limiting value as the output frequency increases 
without limit. The thermal noise of the series resistance is also responsi- 
ble for noise output of the diode. In contrast, a perfect variable capaci- 
tance would have its MAG always equal to the ratio of output frequency 
to input frequency, and it would generate no noise. 

The results obtained depend upon the diode cutoff frequency f, , and 
upon the operating conditions through the parameter A. Values of f. up 
to 200 kme have been obtained with recently prepared diffused diodes, 
hence the use of the value f, = 160 kme is not overly optimistic. 
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APPENDIX 
Derivation of the Equation for Maximum Available Gain 


In Section V the maximum available gain was obtained in the follow- 
ing form, from (31): 
r 2 
| Zan | 
R?(K + 1)?’ 


R?. It is convenient to define the quantity x 


(57) 


nus 


where K = ~V/1 — Z;: 
such that K = V1 + 


La 
l 


x - Be nee 


R® 


- 58 
ZyZm (58) 


After (58) has been solved for R’, (57) can be written 


owt ie ve (59) 
7 —Zrln 2K +1) —Zndn (x + V2? + 1)" 
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The impedance matrix elements, obtained from (13), are 
Zi» 
(60) 


Then 


where 


c 
SARC. 62) 
ed + &) @RC, (O02) 
The quantity Giax in (61) is the gain obtained with matching termina- 
tions. It depends upon the diode parameters RP, and Cin , and upon the 
operating conditions through the operating frequencies and the Fourier 
amplitudes Co and C, . The diode parameters are fixed for a given device, 
of course, but Cy and C; may be varied to further maximize G. To demon- 
strate this, we first express the variables Cy and C; in terms of Cyyax and 
Cmin , Where Cin iS a constant, namely, the lowest value of capacitance 
obtainable. [From (69), below, it can be seen that, for any given value 
of Cmax , the gain increases with decreasing Cjn|. We therefore need 
consider only Cinax to be variable. For convenience, in the calculation of 
maximum gain, below, the variable u is used, where p= Cinax/Cmin - 
The process of maximizing G is equivalent to minimizing x. To aec- 
complish this we start with (62) and express the quantities € and Co in 
terms of p and Cin . From (12a) we have 
ne, 
ts Re 


" (63) 
Since C(é) is a sine wave, it follows that 

Co Cite 2( 1 . 
Equations (63) and (64) may be used to eliminate C, , obtaining 


9 


(’ (65) 


)2 mins 
Next, we express £ in terms of uw. As C(¢) is sinusoidal, it also follows that 
jae = Co + 2C’ . (66) 


Combining (64), (65) and (66), we obtain 


Cie = (; ms ) Cain 
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Cmax l ™ % 2 
% sake ‘ (+ ‘) ‘ (67b) 


aaa. 
h+l 


Finally, combining (62), (65) and (68), we get 


Solving for £, we obtain 


we + 1) 
Pe 


ak nits ’ 


u(u + 1) 
uw—1l- 
The minimum value of A is found to be 


A= (1 + V2)° = 5.83 


v1 l + W2. 
The corresponding value of Cinax/Cmin is 


a 2 / 2 Kk of 
—— =p = (1 + +/2)° = 583. (71) 
( min 
Thus, the maximum gain and minimum noise figure are obtained for a 
variation of capacitance over a 5.8-to-1 ratio. In practice, however, the 
dynamic range of capacitance may be ne greater than 3 to 1. This corre- 


sponds to n» = 1/3, and \ = 6.5. Hence, for this limited range the per- 


formance would still not be far from optimum. 


REFERENCES 


1. Manley, J. M. and Rowe, H. E., Some General Properties of Nonlinear Ele- 
ments, Part I, Proc. I.R.E., 44, July 1956, pp. 904-913. 

2. Edwards, C. F., Frequency Conversion by Means of a Nonlinear Admittance, 
B.S8.T.J., 35, November 1956, pp. 1403-1416. 

3. Macpherson, A. C., An Analysis of the Diode Mixer Consisting of Nonlinear 
Capacitance and Conductance and Ohmic Spreading Resistance, N.R.L. 
Report 4667. 

. Hartley, R. V. L., Oscillations in Systems with Nonlinear Reactance, B.8.T.J., 
15, July 1936, pp. 424-440. 

5. Bakanowski, A. E., Cranna, N. G., and Uhlir, A., Jr., I1.R.E.-A.1.E.E. Semi 
conductor Device Research Conference, Boulder, Colo., July 1957. 

>». Uhlir, A., Jr., this issue, pp. 951-988. 





Nonstationary Velocity Estimation 


By T. M. BURFORD 


(Manuscript received January 3, 1958) 


A nonstationary noise may frequently be approximated by the product 
of a stationary noise and a deterministic function of time. From observa- 
tions of the sum of such a nonstationary noise and a linear signal, an esti- 
mate of the rate of change of the signal is found. More exactly, a random 
function, x(t), is assumed to be one of the following: 


a+ bt + g(t)n(t), 


Pe 
a+ bt + g(t) h(t — r)n(r) dr, or 


v— x 


rt 
a+ bt + | h(t — r)g(r)n(r) dr, 


where a and b are constants, h(t) is the impulse response of a lumped parame- 
ter filter, n(t) is white noise and g(t) ts a nonzero deterministic function. 
A least squares estimate of b is found as a lincar operation on a finite sam- 


ple of x(t). 


I. WHITE NOISE CASE 


A well-known estimation problem is that of forming a mean square 
estimate of the constant, 6, in the random function z(t) = a + bt + n(Z), 
where n(é) is white noise and a and b are unknown constants. The esti- 
mate of b is required to be the linear operation 

at 


-_ | El. ~<)de) de, 
7¢t—T 


where K(z) vanishes outside (0, 7’) and 6 is to equal 6 in the absence of 
noise. The solution is known'” to be 
6 


K(z) = T3 


(T — 22): 


= (): elsewhere. 
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The above problem will be generalized here to include one type of non- 
stationary noise. 
Let 


x(t) = a+ bt + g(d)n(b, (2) 


where a and 6 are unknown constants,* n(¢) is white noise of unit spectral 
density and g(t) is a nonzero, deterministic function. Formally, it fol- 


lows that 
Elg(On(Ogtont)| = gOgt)et — t’). (3) 


We wish an estimate of } in the form 


at 


_ K(t, t — r)a(r) dr, (4) 
7t—T 


where K(¢, z) vanishes for z outside (0, 7’). A constraint is that, in the 
absence of noise, (4) should give 6 exactly. Therefore 


t 
b / K(t, t — r)(a + br) dr, 
¥ t—T 


which implies, with a change of variable, that 


~T 


K(i, 2) dz = 0, 


eT 
| 2K(,2) dz = -1. (6) 
“0 
Using (3), (5) and (6) and again changing variables we find that the ex- 
pected error is now 
7 -T 
E(b — b)° = K'(t, zg (t — 2) dz. (7) 
“0 
The minimization of (7) is easily done by the usual variational tech- 
nique, using (5) and (6) as isoperimetric constraints. The variation gives 


Qg°(t — z)K(t,z) +A + we = 0. 
Therefore 


: ' —rv\ — pz 
K(t,z) = 55 ; (8) 
2g°(t — z) 
* The linear a + bt is used here for simplicity. The coefficients of a general 
polynomial may be estimated in a similar way. 
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where \ and uw are undetermined multipliers to be fixed by (5) and (6). 
From (5), 


T 
| K(t, 2) dz 
0 


and, from (6), 
aT 


r T . 2 
: ee j 
| 2K (t, z) dz = = ; . me 
0 240 g(t, Z) 24 g(t — 2) 


Solving the preceding equations for \ and uv and substituting into (8) 
gives 


- A, i zAo ‘ 
(AnA2 _ APP _ 2)’ 


K(t, z) 


(0; elsewhere, 


with 
A;(0) 


which gives the minimized error 


4.9 Ao 
E(b — b)*° = , 
aihtiNe ye meee Fe 


The denominator of A(t, z) does not vanish, since g(t — z) is assumed 


nonzero, and 
7 2 
~ A, 
. Ao 


+—_—_ dz > 0, 
0 grt — 2) 


A. A: 


:— =~ 
Ao 


Ach: — Ar > 0. 


In general, the A(¢, z) found here defines a time variable filter over a 
finite time interval. However, if g is a constant, (9) reduces to (1). An- 
other interesting special case is that in which g is assumed to be of ex- 
ponential form, which implies that 


g(t — z) = g(Hg(—2), 
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and therefore 


1 ff” 2’dez 
g(t) Jo g?(—2) 
Substitution into (9) gives 
B, oa zBy 


K(2) = ‘ Os 
; (BoB _ B,*)g?(—2z) 


= (); elsewhere. 
Now, however, the B;’s are constants instead of functions of ¢, as the 
A ,’s were. For an exponential g, therefore, the filter given by (9) is time- 
invariant. 


The function g(¢) is not necessarily continuous. For example, let 
e <op 
o> B. 


Then 
B<t—T, 


awe as 
as. - - — —— J = ei fae eB ae ef, 
gr jt il Jo" 97317 1 

Be, uy 


= Mis Ps B > f, 
g°jtl 


which implies that K(é, z) is linear but has a discontinuity in value and 
slope at z = 8B. 

Many applications involve a linear g(é). For this case, it is possible to 
plot a one-parameter family of curves which completely describe K(¢, z). 
If g(t) equals a + Bt, we define Q = —(a + #t)/Bt. Then, by a change 
of variable, 


T? 1 1 3’ ds 
oe I (Q +8)? 


and substitution into (9) gives 


hs co 
T’K(t, z) = 


(CoCr — C2) |@ + ( 
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All of the time dependence is now included in Q. Fig. 1 shows 7T°K(é, 2) 
plotted against 2/7 for several values of Q. A positive value of Q implies 
that | g(¢) | is decreasing with time and therefore that recent data are 
superior. Values of Q in (—1, 0) are not permitted, since such a Q would 
imply that g(t) vanished somewhere in (¢ — 7’, ¢). If g(d) is constant, Q 
is infinite and (1) applies. 


14 














Weighting functions, A(t, z), for linear g(t). 
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The A(t, z) just found for the linear g(t) case may be substituted into 
(7) to give the variance of b as a function of Q: 


m 2 
i E(b — b) = E _ (toe ms ') QQ + » | ‘ (10) 


An interesting comparison may be made with the variance resulting 
from using K(z) of (1) in (7), which gives 


a Al 
a 


: 2 7) 
~K(b — b)° = 3(2Q + 1)° + =. (11) 
B* o 


Equation (11) shows the error to be expected from using a K(z), 


which is optimum in the stationary case, on this type of nonstationary 
data. 

Equations (10) and (11) are plotted in Fig. 2. The difference between 
the curves is a measure of the improvement possible* with time-variable 
smoothing. 

As Q approaches ~, (10) may be written 

eee ee ey : (5) 
E(b — b)* = 3(2Q + 1)* — 5 +0 
a2 5 Q 
therefore (10) and (11) will asymptotically differ by 16/5. 

rig. 2 is plotted for Q > 0; however, it may be used for QQ < —1 since 
both (10) and (11) are even on either side of Q = —0.50. 

Fig. 1 also suggests a simple approximate realization for K(/, z). The 
several curves have approximately common intersections near z/7' = 0.2 
and 0.8. Therefore, K(¢, z) might be represented as 

K(t, z) = Ky(z) + f(Q)K2(z), 
where K,(z) is one of the K(¢, z) curves near the middle of the range of 
Q’s to be considered. The function K2(z) vanishes near the points 
2/T = 0.2 and 0.8 and takes on relatively small values elsewhere. This 
approximate realization thus involves two time-invariant filters and a 
multiplier which depends on Q and thereby on ¢. 


Il. PRE-FILTERING OF NOISE 


Filtering and multiplication by g(¢) are not, in general, commutative 
operations, so a choice must be made between 


q(t) | h(t — r)n(r) dr (12) 


* It has been shown by E. N. Gilbert that, even if a linear g(t) vanishes in 
the smoothing interval, the best possible estimate has variance 8*/7' and, there- 
fore, perfect estimation is not possible. 
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Fig. 2 — Variance for linear g(t). 


at 

h(t — r)g(r)n(r) dr (13) 
as noise functions to be added to a + bt. In certain cases, the representa- 
tions (12) and (13) are virtually equivalent in the sense that, given 
h(t — r), there exists an A(t — r) such that 


at at 
g(t) h(t — r)n(r) dr h(t — t)g(r)n(r) dr. 
From the point-wise uncorrelation of n(r), the preceding equation, if it 
is true at all, must be true for all values of +. Therefore 


g(t) 


A(t — 
g(r) 


h(t — 7), 


which implies that g(¢)/g(7) must be a function of (¢ — r). The only 
nonconstant g(¢) satisfying this condition is the exponential. If g(¢) is an 


exponential, the choice between (12) and (13) is arbitrary. In general, 


however, the choice is not arbitrary and depends on physical considera- 
tions. The forms (12) and (13) will now be treated as Cases 1 and 2. 
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Case 1 


The observed quantity, x(¢), is 


t 


a(t) =a+ bt + g() | h(t — r)n(r) dr, 


where n(7) is white noise of spectral density unity. The transform of 
h(t) is assumed to be 


>> B(iw)? 


T{h(O)) = — - m<n. 


> a;(tw)? 


It is also assumed that the numerator and denominator of 7'[h(t)| have 
neither common factors nor multiple roots. The covariance of 


nt 


h(t — r)n(r) dr 


oO 


is defined to be p(r). A particular K(¢, z) is sought such that 
at 


i, = K(t, t _— r)x(r) dr 


7 


minimizes 


dz’ g(t — 2’) K(t, 2’)p(z — 2’). (14) 


¥ . 
E(b — b)’ =| dz g(t — z)K(t, | 
0 


0 


In obtaining (14), it was assumed that, in the absence of noise, 6 
equals b, or 
aT ~T 


| K(t, z) dz = 0 | 2K (t, z) dz = —1. (15) 


Using the constraints of (15), the variation of (14) gives the integral 
equation* 


T 

te te l , / A z 
| dz K(t, z)g(t — z)plz — z) = 7? ‘ (16) 
/0 g(t — z) 


From the form of 7'A(t)|, it is evident that 


dw, 


l r M (w’) iw(z—2’) 
Da Jw N(w*) * 


2n 
A similar equation, of somewhat different origin, has been considered by 
Ule.’ 


= 
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where M(w°) and N(w’) are polynomials in w of degrees m and n. Now 
form‘ the operation N(—d°/dz*) and apply it to (16), giving 


T +0 2 
tr , ; 2 iw , r l A T 
J dz K(t, z )g(t — z) | Mw )e* da N (- ) 
2m Jo ze dz g(t — 


which is, formally, the same as 


9 fT a © ( 2 

tur ’ , ia ' ’ l A Zz 
:){ dz K(t, z )g(t — z ©” dw N (-; ) eke , 
27 J Jo az Zz) 


or, the differential equation 


’ d\ x 2 
M (- ) K(t, z)g(t — z) = N (- 5) =e (17) 
dz , dz] g(t — z) 


The general bounded solution of (17) will not necessarily satisfy (16), 
however. As is well known,’ singularity functions up to order n — m — 1 
should be added at the end points of the interval in order to satisfy the 
boundary conditions of the integral equation (16). 

Therefore, let 


¥ . 1B ? d A+ pwr \, 
K(t, dg(t — 2) = > ae” (2 — 2). -5)(%; Hae 


n—m 


1 n—m—1 
+ 3 bo" (z) + -, cj" (z — T), 


where 6°” is the Dirac delta function, 6" its derivative, ete. The fune- 
tion k(z — x) is the Green’s function associated with M(—(d’/dz*)) and 
the 2m constants m; are the roots of M(—X°*). The a;, 6; and c; com- 
prise 2n undetermined constants. They may be determined by substitut- 
ing (18) into the integral equation (16) and requiring an identity in the 
2n exponentials of p. For example, assume h(x) e “’, which implies 
that 


Mw) 1 
Nw) @& + °° 


Then 


K(t, z)g(t — (« is ) Be + bod(z) + ed(z — T), 


dz 


g(t — z) 


and substitution into (16) gives the following equations for bp and ¢g : 


d\ X\ + uz ? (« 4 “) A + p2 
y= eo —_ yy = 
, dz/ g(t — z)|z=0 dz/ g(t — z) 2 
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Except for evaluating \ and yu from (15), the solution is 


Ati 2) = 


I 2 —- d A + wz 
g(t — 5|2 ) 7 1)(« T £)\>, — 2) 


Case 2 
The observed quantity x(t) now is 
t 
a(t) =act bt + [ h(t — r)g(r)n(r) dr. 


We define 6, K(t, z) and h(t — 7) as in Case 1 and again require that» 
in the absence of noise, 6 equal b, or that the constraints (15) hold. 
These assumptions lead to the following integral equation: 


. 
[ dz’ K(t, z’)@(t — z,t — 2’) =A + pz, 
where 
&(t—2z,t-— 2’) = h(y)h(y + 2’ — 2)g(t — 2’ — y) dy. 
0 


Instead of dealing directly with the integral equation, we first rewrite 
x(t) as 


a(t) = [ h(t — r)[B + yr + g(r)n(r)] dr, (19) 


where 


A aob 


since it is easily shown that 


| h(z) dx = Bo 


ayo 


[ rh(x) dx = Boon ae 


70 ay” ao 
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from the assumed form of 7|A(é)], and that 


at 


| h(t — r)(8 + yr) dr 


ai) 9) 


p | h(x) dx + yt | h(x) dx — ¥ | rh(x) dx 


when the values of 8 and y are substituted from (19). 
A function y(é) is now defined as 
y(t) = B+ yt + gl(t)n(d) 
and a function K(t, z) is defined as the convolution of K(¢, z) and A(z), 
so that (14) is replaced by 


at 

| R(t, t — r)y(r) dr. 

J—2 

The function y(¢) is similar to x(¢) as defined in (2). Therefore, K(¢, z) 
may be found in a manner analogous to that used in the case in which 
the noise was not pre-filtered. Recalling that A(¢, z) is required to 
vanish outside (0, 7’), we may define K(¢, z) more exactly by 


Rit, 2) = | doK(t, he — 0); O<2 <7 


70 


| 


“0 


~7 


dvK(t, vh(z — vv); 2 


0; 


or the equivalent: 


n j m 


‘wa — 
—— R(t, 2) > B 5 Ki(t,z): O 


0 de 
& «> 9, z2<0. 


Since K(t, z) may be discontinuous at z 0 and T, these points have 
been excluded. While A(¢, z) is constrained by (15), it follows that 
K(t, 2) must satisfy 


2 00 


| K(t, 2) dz 
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The variance to be minimized is now 


E(b — 6)? = | R°(t, z)g°(t — z) dz. 


+0 


From (21) 


’ 


K(t, z) = aft. 2 >.T. (23) 
where the s; are presently unknown but will be selected to minimize 
E(b — b)’ and the ¢; are known in terms of the a;. A variation of 


[R°(t, z)g*(t — z) + AK (t, 2) + w2K(t, 2)] dz 
“0 


gives 


2K(t, zg (t —z) +A + uz =0; O<z 


and a set of nm equations defining the s; : 


n—l a a) 
2D. 8; | Few) — 2) dz + (A + pzdeh® dz 
j=0 VT “T 


where 
F=0,1,--:,n — 1. 
Equations (23) and (24), together with the constraints (22), define 
A(t, z) completely. The function A(t, z) may now be found from (21). 
rom the discontinuities in A(¢, z) at z = O and 7, delta functions 
of order as high as n — m — 1 may be expected in K(é, z) at z = 0 and 
T. For example, if h(a) equals e€ “*, (21) becomes 


IK (t, z e ' T 
sacle + aK(t,z) = K(t,z);} 0O<2z2<7 
dz 


s <0, s>T 
and 
> 
From (24), 
—\—pz 


= “ Cigar 
297(t — 2) 


— z)dz+ / (X\ + pze “dd 
JT k 
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which defines s and therefore K(é, z) for z > T. Except for evaluating 
d and uw from (15), the solution is 


K(t, z) = | -&z2-—-T)+art iC : =) 
dz} \g?(t — z) 

s 1 

A+ a(7 + ) 
1 ay of. 


HAZ — 
aH (t, T) 


where 


a 


H(t, T) = é& e™g?(¢ — 2) de. 
JT 
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Amplitude Modulation Suppression 
in FM Systems 


By C. L. RUTHROFF 
(Manuscript received January 8, 1958 


Inadequacies of some limiter concepts are discussed and a limiter analy- 
sis in terms of low-index modulation theory is presented. The analysis of 
a particular circuit proves the possilility of perfect amplitude modulation 
suppression with small loss to the frequency modulated signal. Experimen- 
tal data are presented which verify the theory and demonstrate the practi- 
cability of the device analyzed. 


I. INTRODUCTION 


Since the early days of frequency modulation (FM) it has been known 
that to realize the full advantage of rm with respect to signal-to-noise 
ratio it is necessary to suppress any amplitude modulation (Am) which 
may be present on the signal. Various kinds of limiters have been used 
for this purpose: overloaded amplifiers, grid and plate clippers and 
vacuum diode clippers, to mention a few. In recent years, the trend has 
been to use germanium or silicon diodes for this function, particularly 
in broadband systems such as the TD-2 and TJ microwave systems and 
in systems using transistors. 

There is some ambiguity in the use of the word “‘limiter.’’ In this 
discussion a limiter is a device whose function is to reduce the index of 
amplitude modulation of the input signal. This function is called limit- 
ing or AM suppression and the amount of limiting is the ratio of the 
index of AM in the output signal to the index of am of the input signal. 

The operation of limiters has been only partially understood and as 
a consequence the operation is not efficient either in the sense of loss 
to the FM signal or in the amount of amplitude modulation suppression 
(limiting). 

If it is recognized that limiting is a modulation process, it becomes 
clear that modulation theory can be used to analyze limiter circuits. 
This is done here for a diode limiter. Theoretical and experimental 
curves are presented, as are equivalent circuits. 


1023 
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Il. GENERAL 


Before launching into the modulation theory of limiters, it is first 
necessary to point out some of the inadequacies of some current con- 
cepts. 

It might be surmised that, if the carrier output voltage is plotted as 
a function of the input voltage, the amount of limiting is given by the 
slope of this curve. In Appendix B it is shown that this is not necessarily 
true. In particular, it is demonstrated that the limiting characteristic 
can be altered without altering the carrier transfer function. 

A common “clipper” limiter and its dual are shown in Figs. 1(a) and 
l(b). The customary explanation of limiting by the circuit in Fig. 1(a) 
is accomplished with the aid of Fig. 2. The diodes are assumed ideal in 
the sense that the back resistance is infinite and the forward resistance 
is zero. The bias current /o is assumed to come from a constant current 
source and to bias both diodes in the forward or low-resistance direction. 
The input signal voltage e shown in Fig. 2(a) is a sine-wave amplitude- 
modulated carrier. The output voltage across R; follows the input volt- 
age until the input voltage reaches the clipping level a, Fig. 2(b). At 
this point, diode p,; switches to its back resistance and further increase 
in input voltage does not appear in the output, since the back resist- 
ance of pb, is infinite. Now, when the input voltage is reduced again 
to value A, D, switches to its forward resistance and the output voltage 
is again a replica of the input. A similar sequence is followed on the 


D, A De 
+4 





a 
Lal 


Io 
(CONSTANT) 
\ CURRENT / 


B 











D, D2 


E 
a (Sonera - 


t. T VOLTAGE [* 


Fig. 1 A common idealized limiter circuit (a) and its dual (b). The diodes 
are assumed to have zero forward resistance and infinite back resistance. 
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negative half of the cycle except that, in this case, it is the diode bp. 
that switches at level —a. The net result is an output voltage of the 
form of Fig. 2(c). If the clipping level a were reduced and made 
approach zero, only the zero crossings of the original signal would be 
preserved and the amplitude modulation would be completely sup- 
pressed. 

The above explanation is useful for some purposes, but it has these 
serious limitations: 

It is not obvious from Fig. 2(¢) that any AM remains in the signal 
when, in fact, there may be a considerable amount. The fact that the 
usual envelope structure has disappeared does not prove the absence of 
AM. 

The output is as shown in Fig. 2(c) only if the limiter circuit has 

bandwidth which includes many harmonics of the carrier frequency 
and the baseband frequencies of interest. These harmonics are not negli- 
gible in amplitude and have important effects on limiting. In most 
practical circuits only the fundamental is passed and the simplified ex- 
planation is not accurate enough. The effects of preventing the flow of 


ANIAAN 


VV 





LEVE 


NAA 





Fig. 2 Input and output signals for the limiters of Fig. 1 
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Fig. 3 — Calculated data for ‘‘clipper’’ limiter passing various harmonies of 
the carrier frequency. 


harmonics of the carrier are shown in Fig. 3. This figure gives limiting 


as a function of available input power level. As mentioned previously, 
limiting is defined as the ratio of amplitude modulation index of the 
output signal to that in the input. Curve a shows the usual practical 
case, i.e., where only the band of frequencies about the carrier and the 


baseband frequencies are allowed. All harmonics of the carrier and side- 
bands are suppressed. Curves B and c include all harmonics up to the 
third and fifth respectively. Curve p includes all harmonics of the car- 
rier and sidebands. Curve £ is for diodes with infinite back resistance. 
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This is the case usually analyzed. These curves were calculated from 
formulae derived in Appendix A; the circuit constants used are indicated 
on the figure. 

3. The third limitation has to do with the assumptions concerning 
the diodes. From Fig. 2(c) it can be seen that, if the clipping level is 
reduced (this is equivalent to increasing input amplitude and keeping 
clipping level constant), the AM suppression increases, as does the loss 
at the carrier frequency. In the limit, the Am suppression is perfect and 
the loss to the carrier is infinite. It is commonly thought that, for this 
reason, increasing the input level increases the am suppression. In a 
practical limiter this is not true. One important reason is that the back 
resistance of a diode is never infinite. In Fig. 4, which is similar to Fig. 
2 except that the diodes are assumed to have a finite back resistance, 
note that the output waveform is slightly rounded instead of being 
clipped sharply. The portions are rounded because, even when one diode 
is on its back resistance, the input voltage influences the output voltage. 
In fact, during the clipping interval the output waveform is simply a 
greatly reduced replica of the input waveform in that interval. Now, if 
the clipping level is reduced to zero, the output waveform is exactly 
the same shape as the input waveform. At every instant, one diode is 
on its back resistance and the other on its forward resistance or, in 
other words, the circuit is linear and there is no limiting whatever. The 
loss, however, is finite. This means that there is some one input level at 
which the limiting is best and that, contrary to common belief, merely 
increasing the input level does not necessarily improve the limiting. This 
is also illustrated in Fig. 3. 

For these three reasons, the simple clipping picture does not give a 


suitable explanation of the limiting action. A more accurate description 


OUTPUT SIGNAL 


CLIPPING 
LEVEL 


Fig. 4 — Input and output signals for a clipper limiter using diodes with finite 
back resistance. 
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is given by analyzing the circuit in terms of modulation theory. To 
justify the modulation theoretical approach, a simple argument can be 
given. 

A simple example of an FM signal is a sine-wave modulated carrier 
with low index of modulation. If the index is sufficiently low, only the 
first pair of sidebands is important. The only difference between this 
FM Wave and an AM wave modulated with the same signal is in the phase 
relationships between the sidebands and the carrier frequency. An AM 
wave can be superimposed on the rm wave so that the Am sidebands 
occur at exactly the same frequencies as do the rm sidebands. This 
situation is illustrated as follows: 


AM: H(1 + k cos pt) cos wt = E cos wt 


+ = leos (w + p)t + cos (w — p)il, 


FM: EF cos (. + ~~ sin nt) & E cos at 
Pp 


, Aw 
+ i = leos (w + p)t — cos (w — pti, 
2p 


k«l @«,, 
P 
where Aw is the peak deviation of the carrier 
p is the modulating frequency 
w is the carrier frequency 
k is the am index of modulation. 
The composite Fm signal with AM present is: 


? , Aw 
EK cos wt + E — [cos (w + p)t — cos (w — p)tl 
p 


+k ~ leos (w + p)t + cos (w — pti, 


where the AM and FM sidebands can be identified from (1). 

The purpose of the limiter is to eliminate only those sidebands repre- 
senting AM, i.e., the second pair in (2). Clearly this cannot be accom- 
plished by any linear circuit, since whatever happens to the: am side- 
bands happens also to the Fm sidebands, which are at the same 
frequencies. The limiter must accomplish two things. First, it must 
identify the AM sidebands; second, it must eliminate them. And it must 
do these things without interfering with the rm sidebands. 

The identification can be made with an amplitude detector. Such a 
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detector does not interfere with an Fm wave but does detect whatever 
AM is present. The output of the detector can then be used to generate 
a pair of sidebands in the output circuit similar to the original am but 
opposite in phase. If care is taken to adjust the amplitude properly, 
these remodulated sidebands will completely cancel the original side- 
bands, and the output signal will be free of Am. It is important to note 
that neither the detection nor modulation affects the Fm signal except 
to introduce some loss. The output of the limiter contains only the 
desired FM signal. 

This argument, although heuristic, indicates three important results. 
It shows that limiting is necessarily a nonlinear process. More important, 
it indicates the possibility, at least theoretically, of perfect limiting using 
diodes with finite forward and reverse resistances and with a finite loss 
to the Fr signal. In addition, it points the way to a mathematical ap- 
proach using low-index modulation theory. 

The limiter configuration of Fig. 5, based on the principles discussed 
above, has been analyzed and the details are included in Appendix A. 

| BAND | ! DIODE 2 BAND | 


PAS == 
FILTER 














MESH 1 MESH 2 
DIODES ARE BIASED IN CONDUCTING DIRECTION (LOW RESISTANCE 


cone SHORT CIRCUIT AT CARRIER AND SIDEBAND FREQUENCIES 
FILTER| OPEN CIRCUIT AT ALL OTHER FREQUENCIES 


bene SHORT CIRCUIT AT BASEBAND FREQUENCIES AND DC 
FILTER OPEN CIRCUIT AT ALL OTHER FREQUENCIES 


Cy, Ca AMPLITUDE OF CARRIER CURRENT IN MESH 1,2 
S;, Sa AMPLITUDE OF SIDEBAND CURRENT IN MESH 1,2 
B,,Bs AMPLITUDE OF BASEBAND CURRENT IN MESH 1,2 
I, , lz AMPLITUDE OF DIRECT CURRENT IN MESH 1, 2 
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? 
| 


e =Elcos wt + k/2 [cos w +p)t + cos (w -p)t] 


IS THE INPUT AM SIGNAL VOLTAGE WITH INDEX OF 
MODULATION kK AND MODULATING FREQUENCY P/277 


Fig. 5 — Limiter circuit with which perfect limiting can be obtained. 
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Fig. 6 — Calculated results for limiter of Fig. 5. 


The limiter of Fig. 5 is similar to that of Fig. 1(a), with three important 
differences. In Fig. 1(a), the impedance in branch 4-B is infinite at all 
frequencies; in Fig. 5, this impedance is finite at all frequencies of inter- 
est. Also, the circuit in Fig. 5 is arranged so that the resistance R, , at 
frequencies in a band centered at the carrier frequency, can be controlled 
independently of the resistance R, at baseband frequencies. (The diodes 
are assumed to have finite resistance.) In addition, only two bands of 
frequencies are allowed: a band including the carrier and sideband fre- 
quencies and another band extending from de up to the highest baseband 
frequency of interest. 
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The resistances R, and R, play a vital role in the operation of the 
limiter. By properly proportioning these resistances, the limiting can be 
made theoretically perfect at one or two input levels. If good diodes are 
used (diodes with a high ratio of reverse to forward resistance; most 
germanium and silicon diodes are good by this definition) perfect limit- 
ing can be achieved when R, = FR, . From formulae derived in Appendix 
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Fig. 7 Experimental results for practical limiter based on Fig. 5, with R-. 
adjusted for one limiting peak. 
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A, several curves of limiting versus input carrier level have been calcu- 
lated and are plotted in Fig. 6. For all the curves R, = 10,000 ohms, 
but FR, is different for each curve. For all such data, limiting is defined 
as the ratio of the AM index of modulation of the output to the index of 
modulation of the input signal. In Fig. 6, the curve for R, = 4500 ohms 
shows two points of perfect limiting, and this behavior is verified in 
experimental data presented in Figs. 7 and 8. It is possible to get either 
one or two peaks, depending upon the value of R, . 

The theory indicates that, if two points of perfect limiting exist, the 
phase of the am sidebands reverses at the first point and again at the 
second. That this actually occurs has been verified by experiment. 

Figs. 9 and 10 show similar results obtained for the condition R, = 
R.. Where possible, this is the preferred condition since it requires less 
by-passing of R, and R,, a problem which is especially important in 
broadband systems. 
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Fig. 8 — Experimental results for practical limiter based on Fig. 5, with R, 
adjusted for two limiting peaks. 
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Fig. 9 — Experimental results for practical limiter based on Fig. 5, with 
R, = R, and adjusted for one limiting peak. 


Figs. 6 through 10 indicate that perfect limiting occurs only at iso- 
lated values of input power for this limiter. No practical difficulty arises, 
however, because perfect limiting can be approximated as closely as 
desired over any finite range of input power levels by cascading several 
stages similar to Fig. 5 and arranging the limiting peaks properly. In 
Fig. 6 the hump between the two peaks of perfect limiting can be re- 


duced to any desired level by increasing R.. This will have the effect 
of reducing the separation of the peaks. After choosing any finite amount 
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Fig. 10 — Experimental results for practical limiter based on Fig. 5, with 
R, = R, and adjusted for two limiting peaks. 


of limiting, R, can be adjusted so that the limiting at the maximum 
point of the hump exceeds this value. (The minus sign is usually omitted 
and we say that 40 db of limiting is more than, or exceeds, 30 db, i.e., 
the decrease in AM index is greater for 40 db of limiting than for 30 db.) 
Then, for a continuous range of input levels, the limiting is more than 
the desired amount. Another stage can be adjusted so that its limiting 
peaks fall near but to the right of those of the first stage, in such a way 
that the limiting between the peaks always exceeds the predetermined 
number. Control of bias current and R, will be required for this adjust- 
ment. If this process is continued it is evident that perfect limiting can 
be approximated as closely as desired over any finite range of input 
levels. 

In many practical situations, one stage is sufficient because the acc 
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(@) EQUIVALENT CIRCUIT SHOWING COUPLING BETWEEN THE CARRIER 
AND DC FOR THE LIMITER OF FIG. 5 
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(b) EQUIVALENT CIRCUIT SHOWING COUPLING BETWEEN SIDEBANDS 
AND BASEBAND FREQUENCIES FOR THE LIMITER OF FIG.5 


Yo=(B-A) A, +A Ro =(B-A) B2+A 
pon SIN NA 7 SIN NA27 
Tn = 2(B-A) —y Rn=2(8-A) ——— 
B = BACK RESISTANCE OF DIODE 

A = FORWARD RESISTANCE OF DIODE 


Fig. 11 — Equivalent circuits for limiter of Fig. 5. 
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of the 1F amplifier acts to keep the carrier input level near the value for 
best limiting. 


III. EQUIVALENT CIRCUITS 


From equation (13) derived in Appendix A, equivalent circuits for 
the limiter of Fig. 5 can be drawn. These are shown in Fig. 11. Fig. 12 
is the dual of Fig. 5. In Fig. 11 all resistances with numbered subscripts 
are coefficients in the Fourier expansion of the resistance function of the 
diodes; lower case r’s indicate diode D;. Resistances with letter subscripts 


are as shown in Fig. 5(a). 
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(b) SAME AS (a) EXCEPT CURRENT GENERATOR REPLACED 
WITH VOLTAGE GENERATOR 


L=I [cos wt+k/2 [cos(w+p)t+ cos (w-p)t]] 1S THE 
INPUT AM SIGNAL CURRENT WITH INDEX . 
OF MODULATION K AND MODULATING FREQUENCY P/277 


Fig. 12 — Dual of limiter circuit shown in Fig. 5. 
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The limiting mechanism can be understood by consideration of the 
circuit involving AM sidebands [Fig. 11(b)|. Some am sideband energy is 
coupled from the input mesh | to the output mesh 2 through R, . Some 
of the remaining sideband energy is coupled to mesh 3 through the 
r,/2 where it appears at the baseband frequency. This baseband energy 
in mesh 3 is coupled in turn through —&,/2 into mesh 4, where it is 
still at the baseband frequency. From mesh 4 it is coupled through R,/2 
into the output mesh 2, where it appears at the sideband frequencies 
in phase opposition to the energy coupled through R,. By properly 
proportioning R, and FR, , the energies coupled by R, and —R,/2 can 
be made to cancel, leaving no sideband currents in the output mesh 
and therefore achieving perfect limiting. The Frm signal behaves in the 
same manner as the carrier and suffers loss only in passing through the 
limiter. 


APPENDIX A 


The material in this section is based on Refs. | and 2. In Ref. 1 
Caruthers develops the superposition principle for low-index modulation ; 
i.e., he shows that circuits containing nonlinear resistive elements can 
be resolved into equivalent linear systems using a large carrier and 
small signal amplitude. In Ref. 2 it is shown that a nonlinear resistance 
can be represented by a Fourier series, the coefficients of which have 
the dimensions of resistance. If the carrier is large compared to the sig- 
nal, this resistance is a function of carrier frequency and harmonics 
only and, to a good approximation, is independent of the signal. 

Based on these ideas, the limiter circuit can be solved by usual linear 
circuit techniques. 


s.1 Superposition 


Any nonlinear resistance has a current voltage characteristic that 
can be expressed by a power series of the form: 


at + at + ay +--+ +a" + -ee. (3) 


If a carrier and signal current are applied to this nonlinear resistance, 
each term after the first will produce voltages at new frequencies con- 
sisting of the intermodulation products of the carrier and signal. If 
finite external impedances are presented to these voltages, currents will 
flow at these new frequencies and produce still more new frequencies. 
The number of currents can be minimized by presenting infinite im- 
pedance to some of the modulation products. 

If a carrier current C cos ct and two signal currents S; cos sf and 
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Se COS Sef are applied to the nonlinear resistance the voltage due to the 
nth term of (3) is 

C, = A,(C cos ct + S; cos st + Se cos sot)” 
(4) 


n S 
A,C (co: ct + ao 


S 2 
1 D2 
COS St + ri cos ot) , 


We now make the usual assumption of low-index modulation theory. 
This assumption is that S,/C and S./C are small enough so that 
terms in the expansion of (4) containing products and powers of S,/C 
and S2/C are negligible. The expansion of (4) is then: 


Y 


Ce AC (cos ct ++ ncos” et a cos st + n cos” ct A cos ot) . (5) 
The first term in (5) is the de and the harmonics of the carrier. The 
second and third terms contain the input signal, the output signal (usu- 
ally the first order sidebands) and other unwanted modulation products. 
The important thing to note about (5) is that the input and output 
voltages (second and third terms) are the same whether the input signals 
are applied separately and the outputs summed or whether the inputs 
are applied simultaneously. Therefore, the superposition principle holds 
and is a direct consequence of the fact that the carrier amplitude is 
much greater than the signal amplitude. 


A.2 Fourier Series Representation of Nonlinear Resistance 


The power series expansion for the nonlinear resistance as a function 
of current can be obtained by differentiating (3) with respect to 7. The 
nonlinear resistance is: 


+ Qai + Ban? + --- + nA,” + -- (6) 


where r(z) is the nonlinear resistance. 

The usual assumption made in low-index amplitude modulation theory 
and the one made here is that only the carrier current is significant in 
determining the nonlinear resistance. If the carrier current 7 = J cos wt 
is substituted into (6) and the series rearranged, the result is a Fourier 
series. Such a series can be written as 


r(t) = ro + >> rp, cos nwt. (7) 
n=1 


If the current-voltage characteristics of the nonlinear element were 
expanded in terms of voltage instead of current, an analogous analysis 
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would lead to a representation of the element as a conductance: 


00 
g(t) = go + be Jn COS Nut, (8) 
n=l 
Kither (7) or (8) can be used to represent a nonlinear resistance in low- 
index modulation problems. The accuracy of the representation depends 
upon the index of modulation; the lower the index, the better the repre- 
sentation. 


A.3 Limiter Analysis 


With superposition established and a simple expression for the non- 
linear resistance, the rest of the analysis can be done by usual linear 
circuit techniques. The procedure is as follows: 

i. An amplitude-modulated sine wave voltage is applied to the circuit 
of Fig. 5. No rm need be included, since it is not affected in any way by 
the limiter except to suffer some attenuation. 

ii. A solution is assumed; i.e., currents at certain frequencies are 
assumed to flow in the two meshes. The magnitudes of these currents 
are unknown and are to be determined. Currents at all other frequencies 
are not allowed to flow. 

iii. The mesh equations are set up and solved for pertinent currents. 

With reference to Fig. 5, the input voltage F is a carrier, amplitude- 
modulated with a sine wave: 


e= Ecosat + : E [cos (w + p)t + cos (w — p)él, 


where k = index of modulation 
w/ = carrier frequency 
p/2r = modulating frequency. 
The currents that are assumed to flow are: 
= (cos wt + S, [cos (w + p)t + cos (w — p)t] + Bi cos pt+ 1, 


C2 cos wt + S2 [cos (w + p)t + cos (w — p)t] + B. cos pt + I.. 


(10) 


The circuit of Fig. 5 is arranged so that no other currents are allowed 
to flow. With these currents the 7R drops across the nonlinear elements 
can be calculated. Let the diodes have the following expansions: 


Diode py: ry(t) = ro + bs rn COS Nut, 


n=l] 


Diode be: ro(t) = Ro + Zz (—1)"R,, cos not. 


n=1 
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The factor (—1)” in ro(t) is due to the fact that the diodes are poled in 
opposite directions. The voltage drops across the diodes are found by 
multiplying (10) and (11): 


. ry 
v1 147;(t) = C7 (> -f- 2) cos wt 


-+- rs + . S, + - B,| [eos (w + pit + cos (w — prt 


’ r 
+ (roB, + ryS) cos pt + > Ci + Tiro + Sin cos owt, 


; R. 
Yo joro( t) Ce (1% + ) COS wl 


; Po R 
at. | Ros: 4- = So — = B.| leos (w + pt + cos (w — pt 


(12) 


R, 


+ (RoB» — RSs) COS pl - > C's -- T2Ro _— Tok COS wt. 


The next step is to equate the voltage drops at each frequency to the 
generator emf’s in each mesh: 


K - RC —— RC» + lin; . 
{ Carrier 


0 = —RO + RC, — 1h} 


-V =", + Niro + Ri) — be 


> de 
=F 0, + Tie + R= ER 


) 


= 


KE 
: RmS1 re R Se + = B, 


} sidebands 


R, | 
5 Ba| 


« ) 


Q = 11S + (70 4 R,)B, — RB, 


0 = —RSi + RaS2 — 


> baseband, 


0 = —R,S. — R,B, + (Ro + Rs) Be} 


where 


Rm =to+ = +Re +R 


R, =Ro+ = +R.+R1. 





AMPLITUDE MODULATION SUPPRESSION IN FM SYSTEMS 1041 


The set of equations (13) can be solved for the pertinent currents: 
C; 

> 
v1 


( 
2 I 
E<R.l(ro + Rs)(Ro + Rs) — Re] - z 


+ V (Rater + nRRo ou -) 


(ro + Rs)? 


) 


D , 


/ 


E<RA(ro + Rr)(Ro + Re) — Re] — 2 RRs 


+ V (Ratti + mR-Ro = ae) 


dD . 


- E<RA(ro + Ro)(Ro + Ro) — Rel a 


- _ 


D 


/ 


where 
Ry 
= R.,.(ro + Rs) 


— 7 RARo + Rs) + nRiR.R, 4 (2B) 3 


D = [(ro + Ri (Ro + Ro) — RY WR»AR: — RZ) - 


The insertion loss ratio is 
eo ‘. Ce R, 
Fwetl i? 
tR, 


and the limiting, or amplitude modulation suppression, is 


limiting , ; ‘ (16) 
5 & 

Expression (16) shows that perfect limiting occurs when S, 0. The 
numerator of S. as given in (14) indicates the possibility of perfect 
limiting if R, and R, can be adjusted independently. 

At this point it is convenient to place some further restrictions on the 
diode characteristics so that the calculations can be made more easily. 
The diodes are assumed to switch from a constant forward resistance A 
to a constant backward resistance B. The fraction of the carrier cycle 
that the diodes remain on the back resistance is designated 8; , Be . 
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With these simplifications, the diode characteristics given in (3) are: 


i) 
r(t) = ro + x rn COS nwt 
n=] 


a (17) 
ro(t) = Ro + 2, (—1)"R,, cos not, 
n=l 
where 
ro = (B cae A)p) + A, Ro = (B = A)B» + A, 
- 2(B - 2(B — A) 


’, = ~ A) sin nBy7, R, = sin npor, 
ne nr 
B = back resistance of the diode, 
A = forward resistance of the diode, 
8 = the fraction of the cycle during which the diode is on its 
back resistance. 

All of the currents in (14) are functions of FE, 6; and 82, where £ is the 
amplitude of the input carrier voltage. In order to make calculations 
for (14), relations among EF, 8, and 6, must be derived. 

When the input to the limiter is very low, both diodes are on their 
forward resistances, due to the forward bias current J). At these low 
levels 8; , 82 are both zero and, from (14), 


C2 R. 
en (18) 
C"1 R. + R; 

Because the circuit is resistive and because only the carrier current 
is involved in switching the diodes, (18) holds for all instantaneous loop 
currents less than Jo. From the definition of 8, the diodes switch to 
their back resistances when 

C, cos (—Bir) = Io, C2 Io, 

(19) 
C’, cos (— Bor) = Io, C, = Io. 

At the switching point, both (18) and (19) hold. Therefore, by substitut- 
ing (18) into (19), a relation between 8, , 82 follows: 


R 
-OS f = : ‘OS Pot. 20) 
cos Byr Esa (20) 
In a similar manner, a relation can be derived for F, 6, . When C, = Jo, 
8; = 6. = 0 and, from (14), 


(Rnktz — Re’) 


. = i . of 9) 
im To R, (8; = Be = 0) (21) 





AMPLITUDE MODULATION SUPPRESSION IN FM SYSTEMS 1043 


Now, for any EF 2 Ey, diode pb, switches to its back resistance when 


(RR: — R.| 
0 


E cos Bir = Ey = 1 R. th a le OH 


(22) 
where /o is the bias current in each diode and is given by (13) for 6; 
B. = 0. Thus, 

y 


lie 2R, + A 


, 


where 
V = de bias voltage 
A = forward resistance of diode. 
Curves of power output and amplitude modulation suppression have 
been calculated and are shown in Fig. 6. 


A.4 Circuit Simplification 


A natural question to ask about the limiter is, “Can perfect limiting 
be obtained when R, R.?” If so, the circuit is much simpler. The 
answer to this question is yes, and this can be demonstrated by con- 
sidering the numerator of S,. Perfect limiting is achieved when this 
numerator is zero. Setting the numerator of S, equal to zero and letting 
R, = R,, the resulting expression is solved for R, : 


9. D nk, l 
anette (oe, i s) 
R, = 


: ro + Ro : 
where 
R, = 0. (25) 
Using this fact in (24), it follows that perfect limiting can be obtained 
when R, = R, if the following inequality is satisfied: 
m R l " 
~ a © =, (26) 
2ro 2Ro 2 
Let us examine this further by using the idealized diode character- 
istics of (17). Substituting these expressions into (26), we get 


2(B — A)B, 


(= bir) ae 3 (= ae) | + 
Byr (1 4 A ; ) Bow (1 es A ) 


2(B —A )Be 


This inequality can be met easily with good diodes, i.e., diodes in which 


B/A > 1. 
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Figs. 9 and 10 show a limiter and experimental results for the case 
R, R.. It should be noted that, even with poor diodes, the limiting 
can be perfect if R, and R, are allowed to differ. This can be seen in 
the expression for Sz . 


4.5 The “Clipper” Limiter 


A well-known limiter has the configuration of Fig. 1. The behavior 
of such a “clipper” limiter can be examined by setting FR, R. and 
letting R, and V increase without bound in such a way as to maintain 
the initial bias current J) unchanged. This will also insure that 6, Bs 
and r, R, 

If these exercises are performed on (14) and the results substituted 
into (16), the limiting is found to be 


KE 
Dy -+ 2rilo : 


limiting 
where 
K carrier voltage amplitude, 
Io constant current de bias, 
ry [2(B — A)/x| sin Bir. 


Several remarks can be made concerning the “‘clipper limiter” of (28): 

i. Before the diodes begin to switch, r; = 0 and there is no limiting. 
This is to be expected. 

ii. If J) = O there is no limiting. This too is to be expected because, 
with no bias, one diode has its back resistance in the circuit while the 
other one has its forward resistance in the circuit, and vice versa. The 
circuit is linear under these conditions and cannot limit. 

iii. Expression (28) indicates that the larger r,, the better will be 
the limiting for a given J). But 7, is proportional to (B — A), so this 
indicates a good diode for a limiter of this type is one with a high back 
resistance B and a high ratio of back-to-front resistance. 

iv. If the diodes have a finite back resistance, perfect limiting cannot 
be achieved. Furthermore, as 6; approaches 1/2, the limiting tends to 
zero. The reason for this is that 7; is proportional to sin 8,7, which tends 
to a constant value as 6, approaches 1/2 while F increases without limit. 

The output carrier current can be calculated in the same way as (28). 
It is 


E 4- 2rilo 


‘ 2ro + Te + R, 


(29) 
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Comments can also be made concerning this equation: 

i. At low levels of input voltage the diodes do not switch and r; 
re = 0, and ro A. The output is then Cy E/(2(R; + A)], as ex- 
pected. 

ii. If J, = 0, then £, Bo 1/2, resulting in ro = (B + A)/2 and 

0. In this case Co E/(B + A + 2R,) and the circuit is linear. 

In the above discussion of the ‘‘clipper’’ limiter, all harmonics of the 
carrier were suppressed. Two other cases have been considered: in one 
case, the third harmonic was allowed and, in the other, the third and 
fifth harmonics were allowed (even harmonics cancel). The analysis 
follows the same pattern as previously demonstrated except that cur- 
rents at all allowed frequencies are assumed. Only the results will be 
shown. 

When the third harmonic is allowed, limiting is given by: 


l 


cos Br r3(ro + 14) 
cg 
Ri+AL* 22R, + 2ro + 16. 


Similarly, when the fifth harmonic is added, 


cos Br 
L (: Ty fr, — ral(re + ra)(2R, + 2ro 4 


?Y + A 


(re + rs)(r4 + 7¢)I 
— rsi(re + ra)(re + rs) — (ra + re)(2Ri +: 


Equations (30) and (31) are plotted in Fig. 3. The curve for all harmonics 
was calculated by a Fourier analysis of the ‘‘clipped” wave. 


APPENDIX B 


In this Appendix it will be shown that the amount of limiting is not 
necessarily given by the slope of the carrier transfer function. 

Assume that the limiter of Fig. 5 is adjusted for perfect limiting for 
the carrier amplitude EK = F,. In the equivalent circuits of Fig. 11, it 
is noted that the output carrier current C2 depends upon the de coupling 
through R,. Note further that the sideband current S. depends upon 
the baseband coupling through RF, . 

By proper filtering, the de and baseband currents can be made to 
flow through separate resistors without changing the behavior of the 
limiting or carrier transfer functions. With respect to R, , the equivalent 
circuits of Fig. 11 are then completely independent. If the baseband 
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resistance in Fig. 11(b) is changed, Se, will clearly change. But the car- 
rier transfer function is undisturbed, since no change was made in Fig. 
11(a). Therefore, the limiting is not completely determined by the slope 


of the carrier transfer function. 
This behavior has been verified experimentally using a limiter similar 


to that of Fig. 7. 
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Oxides of the 3d Transition Metals” 
By F. J. MORIN 


(Manuscript received April 16, 1958) 


The magnetic, electrical and optical properties of the 3d metal oxides are 
examined for clues to the energy band structure. A tentative energy band 
scheme is proposed for the oxides of scandium, titanium and vanadium 
which suggests that the nonbonding 3d orbitals of neighboring cations overlap 
sufficiently to form a 3d conduction band. The 3d orbitals do not overlap in 
the oxides of the remaining 3d metals, with the consequence that the 3d 
electrons are in isolated energy slates and electron transport occurs by electron 
exchange between cation neighbors and involves an activation energy. An 
energy level scheme for these latter oxides is caiculated from simple electro- 
statics, and il is shown to agree with the observed electrical properties. Con- 
duction in these oxides includes transport of high-mobility holes in the 2p 
band of the oxygen lattice and of low-mobility holes and electrons in the 3d 
levels of the cation lattice. 


I, INTRODUCTION 


The transition metals combine with other elements to form a host of 
compounds which are electronic conductors and which range in chemical 
bonding from ionic (oxides), through covalent (sulfides, arsenides) to 
metallic (carbides, nitrides). This range in valence character promises a 
great variety of energy band structures and transport processes and offers 
a rich field for theoretical and experimental investigation. No detailed 
determination of band structure or transport has been made for any of 
these compounds, and so they constitute an enormous gap in our knowl- 
edge of the solid state. Indeed, except for the 3d oxides, the semicon- 
ductor properties of transition metal compounds are largely unknown. 
In recent years the use of these oxides in electric! and magnetic? circuit 
elements and in heterogeneous catalysis,’ as well as their importance in 
corrosion’ have emphasized the necessity for a fundamental understand- 
ing of their properties. Now, from our point of view, the most funda- 
mental aspect of a solid is its energy band structure. Therefore, it will 

* This article is a chapter from Semiconductors, edited by N. B. Hannay, to be 


published by Reinhold Publishing Corp., New York, as part of the American 
Chemical Societ y Monograph Series. 
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be the concern of this paper to provide a tentative energy band scheme 
for the 3d oxides. The discussion will be restricted to the simple oxides 
and avoid the more complicated situations to be found in the oxide 
compounds, such as garnets, spinels and perovskites. Because of the 
scarcity of reliable experimental data on single-crystal material, the 
proposed scheme can only be regarded as tentative. At the same time, 
it provides a useful frame of reference for the planning and interpretation 
of future experiments. 

We can begin by considering a relatively simple oxide, for example, 
zine oxide, which is composed of ions having a closed-shell electronic 
configuration. The energy bands in zine oxide arise, to a first approxima- 
tion, from the filled 2p levels of the O- and the empty 4s levels of the 
Zn** which are broadened when the ions are brought together to form 
the solid. The filled 2p band is separated from the empty 4s band by a 
forbidden energy region. Thus, zine oxide is a semiconductor. In the 
same way, a 2p band and a 4s band are expected to arise in the 3d oxides. 
However, it is not easy to see what to expect from the partially filled 3d 
levels of the cations, and we must attempt to answer such questions as (i) 
whether the 3d wave functions overlap sufficiently to form a 3d band 
in any of the oxides or if they do not overlap, so that the 3d levels are 
localized energy levels, and (ii) the location in energy relative to the 3d 
levels (or band) of the 2p band of the anion lattice and the 4s band of 
the cation lattice. 

To learn something about the properties of a 3d band it is instructive 
to consider the energy band situation in the 3d metals. Energy band 
calculations have been made for some of these metals, using the cellular 
and the tight binding methods.° Usually it is found that the 4s band is at 
least 10 ev wide and overlaps the 3d band, which in nickel, for example, 
is about 2.8 ev wide. On the other hand, the 3d band can hold ten 
electrons per atom, while the 4s band can hold only two electrons. This 
difference in numbers accentuates the difference in bandwidth: per 
electron, the 3d band in Ni is only 7's as wide as the 4s band. The narrow- 
ness of the band means a high effective mass and low mobility for charge 
carriers in the band. In the metals, the electric current is carried largely 
by electrons in the 4s band, but these have their mobility decreased by 
transitions to the 3d band. In the 3d oxides, the increased internuclear 
distance can be expected to reduce the 3d bandwidth still further, be- 
cause the overlap of the 3d wave functions will be greatly reduced. This 
will result in a further increase in effective mass and lowering of mobility. 


Later in this paper it is shown that the 4s band probably does not overlap 
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the 3d band in most of the oxides, so that electronic conduction can be 
entirely within the 3d band and thus involve carriers of very high ef- 
fective mass and very low mobility. On the other hand, transport in a 
4s or 2p band, should it occur, will probably be evidenced by the rela- 
tively high mobility of low-mass carriers in a wide band. Results of soft 
X-ray emission indicate the 2p band in oxides to be 10 to 20 ev wide.® 
When the 3d band becomes extremely narrow it is no longer meaning- 
ful to assign a width to the band, and the 3d charge carriers can be con- 
sidered to occupy energy levels localized on the cations. It was first 
pointed out by de Boer and Verwey’ and by Mott’ that this situation 
must exist in many oxides such as CreO;, MneoO;, FexO;, CoO, NiO 
and CuO which are insulators when pure and stoichiometric and have 
room-temperature resistivities exceeding 10'° ohm-cm. In order that 
conduction in the 3d levels can occur there must be present, in pure 
NiO for example, some Ni**+* and Ni* ions (electron hole pair) which 
can move about by exchanging electrons with the normal Ni** of the 
lattice. These excited states, however, can only exist at high tempera- 
tures. Nickel oxide can be made conducting at room temperature by 
introducing Ni*t** ions into the lattice, either by a departure 
from stoichiometry (vacant Ni** sites) or by the substitution of Lit 
ions for Nit* ions. Wandering of the Nit** about the Ni** lattice by 
electron exchange is possible even though the cation separation of 3A is 
twice the cation diameter. Transport of this type is expected to involve 
an activation energy associated with self-trapping of the charge carriers. 
Mobility will be extremely low and increase exponentially with tempera- 
ture. Such behavior is distinctly different from transport in a band where 
mobility decreases with increasing temperature because of lattice scatter- 


ing. Semiconductors in which transport is by electron transfer do not 


show many of the effects which have been used for study of the band 
structure and transport in semiconductors like silicon or germanium. 
For example, no Hall effect, photoconductivity or carrier injection is 
observable, so that transport measurements depend primarily upon 
Seebeck effect and electrical conductivity. 
The oxides series to be considered in this paper is the following: 
'TiO(d) | FeO(d*) | CoO(d")| NiO(ds)| CuO(d’) 
Se2O3(d°) | TizO3(d") | V2 O;(d?)| Cre Os(d*)|Ming Os(d*) | Fez Os(d®) 
'Ti0x(d°) 
The electronic configuration of the 3d shell is indicated. The 4s shell is 
assumed to be empty in all cases. The useful data concerning these oxides 
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are scant and most of them have been taken on ill-defined polycrystalline 
samples. However, from these data we shall try to derive a general 
energy band scheme for the 3d oxides. Particular attention will be paid to 
transport and optical absorption, because these properties give an 
indication of 3d bandwidth. Also, because the majority of the 3d oxides 
are antiferromagnets, any correlation between their magnetic and elec- 
tric properties will be considered. In Section II a brief description of 
their magnetic properties is given. In Section III crystal field theory is 
discussed in a simple way and used to explain the available absorption 
spectra of the 3d oxides. In Section IV it is shown to be probable that a 
3d band having appreciable width exists in the oxides of scandium, ti- 
tanium and vanadium, whereas in the remaining oxides the 3d wave 
functions appear to be localized on the cations. In Section V the relative 
energies of the 4s, 3d and 2p levels are calculated. This energy level 
scheme is shown to fit fairly well the experimental results on NiO (Sec- 
tion VI) and Fe.O; (Section VII). Finally, in Section VIII, a few specu- 
lations are made concerning transport in the localized 3d levels. 


Il. ANTIFERROMAGNETISM® 


The transition metal oxides are antiferromagnetic, a state character- 
ized by an ordered antiparallel arrangement of electron spins. Since the 
electronic structure of a solid underlies its magnetic as well as its semi- 
conducting properties, it is not surprising that experiments on oxides 
have shown a connection between these two properties. Two ways of 
explaining magnetism have been used—the energy band approach for 
transition metals, and the ionic approach for transition metal oxides. 
In transition metals, the magnetic atoms lie close together and there is 
an interaction between the spin moments of neighbors. This exchange 
interaction depends upon the overlap of atomic orbitals. For atoms hav- 
ing a partially filled 3d shell the interaction may be either negative (anti- 
parallel spins) or positive (parallel spins), depending upon interatomic 
distance, and changing from negative to positive with increasing dis- 
tance. It is possible to divide an antiferromagnetic metal crystal into 
two magnetic sublattices, with each atom having nearest neighbors of 
opposite spins. The situation is different in transition metal oxides. In 
these, the magnetic cations are separated by anions. An example of this 
is MnO, whose sodium chloride structure is shown in Fig. 1. It is impos- 
sible for atom A’ to have a spin direction antiparallel to that of all of its 
nearest neighbors, since one half of these B’, C’ and D’ neighbors must 
have parallel spins to give an equal number of up and down spins in the 
lattice. There is one spin ordering, however, in which A’ is antiparallel 
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Fig. 1 The sodium chloride structure and antiferromagnetic spin alignment 
in MnQ., Spins in a (111) plane are parallel, with the (111) planes coupled anti 
parallel to each other. (After Gorter, Ref. 2.) 


to all of its next nearest neighbors; this is shown in Fig. 1, and is the 
one actually found by neutron diffraction. 

But now the antiparallel magnetic ions are separated by anions and 
the problem is to explain how exchange interaction can occur when 
electron orbits do not overlap. One way of approaching the problem 
was suggested by Slater.® Because of the exclusion principle, electrons of 
like spin will tend not to occupy the region between anion and cation. 
That is, electrons with + spin on a cation will repel electrons with + 
spin on the neighboring anion somewhat more than electrons with — 
spin. This repulsion will deform the outer electronic shell of the anion. 
In Fig. 1 it can be seen that anion X, for example, has three Mnt*+ 
neighbors (C’, B, D) in a (111) plane (dotted diagonals) and having + 
spin. Anion electrons with + spin will tend to collect on the side of the 
anion opposite from this plane of cations. This tendency will be aided if 
the (111) plane (solid diagonals) of cations (C, D’, B’) on the opposite 
side of X have — spins, and it may be seen in Fig. 1 that this is the 
‘ase. Thus, the antiferromagnetic alignment between spins in the (111) 
planes represents a lower energy state than does the ferromagnetic 
alignment. A different quantitative description of the coupling between 
electron spins of two cations separated by an anion has been given by 
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Anderson'® and by Yamashita and Kondo." It is called superexchange. 
Since the p orbitals of the anion are directed 180° apart, exchange is 
found to be strongest when cations are on opposite sides of the anion 
and weakest when they are at 90° from the anion. 


At high temperature, thermal vibrations prevent the antiparallel 


ordering of spins, and the material is paramagnetic. The transition 
temperature separating the paramagnetic and antiferromagnetic states 
is called the Néel temperature, 7'y . A plot of 7'y as a function of the 
number of spins per cation is given in Fig. 2 for the oxides which are of 
interest in this chapter. The transition at 7'y is characterized by the 
occurrence of maxima in specific heat, magnetic susceptibility and the 
expansion coefficient. Electron spin resonance disappears below T'y . 
The direction of spin alignment can be determined by neutron diffraction 
and magnetic susceptibility, using single crystals. 

In the rhombohedral sesquioxides TixO;, V2O;, CreO; and FeO; 
the cations are in pairs of closely spaced parallel layers, each pair of 
layers separated from the next pair by a layer of oxygen ions. The spatial 
arrangement of the ions projected onto a plane is shown in Fig. 3. Cation 
layers are represented by solid horizontal lines and anion layers by 
dotted horizontal lines. The heavy lines show how four cations are 
coordinated with each anion. Each cation participates in 18 cation- 
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Fig. 2 — Plot of Néel temperature, 7'y, against the number of spins per cat 
ion for some of the 3d oxides discussed in this paper. 
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Fig. 3 — A projection onto a plane of the spatial arrangement of a group of 
ions in the rhombohedral sesquioxides. Cation layers are represented by solid 
lines and anion layers by dotted horizontal lines. The spin orientations of the 
cation layers are shown for Fe.Q; and Cr.Q;. The cation-anion-cation paths of 
superexchange are indicated by the heavy lines which do not include a 90° angle 
(After Gilleo, Ref. 35.) 


anion-cation configurations, one-third of which are shown in Fig. 2. 
Because half of the configurations include an angle of about 90°, only 
nine configurations are important in exchange interactions. The spin 
alignment shown in Fig. 3 was found by neutron diffraction” and by 
magnetic susceptibility." Spin alignment has not been determined in 
Ti,O; and V.O;. 

A second magnetic transformation is possible in the rhombohedral 
sesquioxides; that is, a change in the direction of spin alignment from 
perpendicular to the three-fold axis, as in Fig. 3, to parallel to this axis. 
According to neutron diffraction,” magnetic susceptibility’ and spin 





1054 THE BELL SYSTEM TECHNICAL JOURNAL, JULY 1958 


resonance, this transformation occurs between —30°C and —15°C in 
FeO; . 

Another interesting property of Fe.O; is its weak ferromagnetism,!® 
which persists up to Ty and has been explained on the basis of sym- 
metry considerations.” The internal magnetic field due to the ferro- 
magnetism produces an anomalous Hall effect,'® which depends upon 
the magnetization of the material and disappears above 7'y . This Hall 
effect probably can be understood on the basis of the theory'® for a 
similar effect found in the transition metals. It cannot be used as a 
measure of carrier concentration in the usual way. 

xchange interaction, besides producing spin alignment, also tends 
to move the ions in such a way as to increase the exchange, and thus it 
causes lattice deformation. The net number of antiparallel neighbors 


increases with decreasing temperature because the opposing effect of 


thermal vibration decreases; correspondingly, lattice deformation 
increases with decreasing temperature. At temperatures below T'y , the 
exchange coupling between the (111) planes of spin cause rhombohedral 
distortion in MnO, FeO and NiO and tetragonal distortion in CoO.*° 
Lattice distortion below 7'y is also found in the rhombohedral sesquiox- 
ides. The exchange coupling between the cation layers contracts the 
lattice along the trigonal axis*! but does not change the symmetry. 


III. ATOMIC ORBITALS AND THE CRYSTAL FIELD™ 


If the 3d wave functions in an oxide do not overlap to form an energy 
band but remain concentrated near the cation, the cation, in effect, is 
isolated. It experiences, to a first approximation, only the electrostatic 
field due to the surrounding oxygens. Such a situation exists in the 
hydrated salts of the transition metals, for example the complex 
Ni(H.O),**, where the cation is surrounded octahedrally by six H,O 
molecules with the oxygens (the negative end of the water dipole) nearest 
the cation. Similarly, in NiO, which is a face-centered cubic structure, 
the negative oxygens are also arranged in a regular octahedron about 
the Nit*. In fact, the optical absorption spectrum for NiO is very similar 
to that for the hydrated salt. Therefore, these considerations suggest 
that the same theories might be applied to the 3d oxides as have been 
used in recent years to explain the optical properties of complex salts of 
3d ions. Conversely, this implies that, if a given oxide spectrum can be 
understood in this way, the 3d wave functions in that oxide are con- 
centrated near the cation and do not overlap to form a band. Accordingly, 
we will now see what available spectra indicate as to the possible ex- 
istence of 3d bands. 
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Three theories have been used to account for the properties of the 
complexes: 


1. molecular orbital,”*. 


2. valence bond, 
3. erystal field. 


The advantage of the crystal field theory is that it can give information 
in a simple way about the excited states of the cation. Thus, it furnishes 
an explanation of the absorption spectra and promises to give insight 
into the electric and magnetic properties of the oxides. 

The crystal field theory assumes a purely ionic model and considers 
the effect of the octahedrally coordinated negative anions on the 3d 


electrons of the central cation. The geometry of the cation and its 

surroundings is shown in Fig. 4. Suppose, at first, that the central ion 

has a single 3d’ electron with the quantum numbers / = 2, s = 3. There 
‘ 


are five (2/ + 1) d orbits, all having the same energy in the isolated ion. 
The associated wave functions can be expressed as a function of radius, 
r, multiplied by functions with angular variation of the form xy/r’, 
yz/r?, zx/r?, (x? — y*)/r? and (222 — 2? — 7*)/r*. These orbitals can be 
represented by the charge cloud distributions shown in Fig. 5. The zy, 
yz and zx orbitals are designated as de and the other orbitals as dy. 
When negative anions are on the x, y, z axes, the negative charge clouds 
of the electrons in the dy orbitals, which are directed at the anions, are 


@ CATION 


& ANION 


Fig. 4— A regular octahedron of anions surrounding a central cation. 
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Fig. 5 Angular distribution of the five 3d orbitals 


repulsed by the charge cloud of the anion. Thus, the dy orbitals have a 
higher electrostatic energy than the de orbitals, which are directed be- 
tween the anions, and the ground state of the central ion is split as shown 
by the energy level diagram Fig. 6. This diagram is for an anion ar- 
rangement having cubic symmetry, which occurs in the rock salt oxides, 
and trigonal symmetry, which occurs in the rhombohedral sesquioxides. 
The trigonal case, since it involves a regular octahedron of anions, 
yields nearly the same excited states as does a cubic field, with an addi- 
tional broadening of the absorption bands due to the small splitting of 
the de levels shown. 
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Fig. 6 — The ground state configuration of the five 3d orbitals: when in the 
free ion; in a erystal field of cubic symmetry (as in MnO, FeO, CoO and NiQ); 
in a erystal field of trigonal symmetry (as in the rhombohedral sesquioxides). 
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An important rule concerning the splitting of these levels is that 
their centers of gravity in energy are not altered by the perturbation. 
That is, 


(1) 


Fixing the energy scale to be zero at the ground state of the free ion 
and defining 10 Dg as the energy difference between the two levels 
which a single 3d electron can occupy when subjected to a cubic field 
gives, from (1) 


E, —4Dq, 
E, + 6Dq. 


The relative positions of the excited states above the ground state, as 
determined for ions with electron configurations d' to d® subjected to a 
cubic field, are shown in Fig. 7. To the left in each diagram are shown 
the free ion ground states, labeled with their term symbols. In diagrams 
(c) and (d) are also shown the first free ion excited states having the 
same spin multiplicity as the ground states, and in (b) two excited 
levels for the free ion are depicted. To the right in each diagram are shown 
the ground state and excited states for the ions in the solid, as produced 
by a cubic crystal field of strength Dq. Configuration d*, shown in (b), is 
plotted in a different way (after Orgel*®) from the other configurations, 
because its excited states but not the ground state are split. In these 
diagrams, the first excited state of the ion corresponds to the excitation 
of an electron from the de to the dy levels. For example, in NiO, con- 
figuration d°, the first excited state of the nickel ion is due to the electronic 
transition represented by the change in configuration d*% d’y — d°« d*y, 
which leaves a hole in the normally filled de levels. The energy of these 
two states can be calculated as follows. The ground state, d*e d*y, is 
located at (—4Dq XK 6 + 6Dq X 2) —12Dq. The excited state, 
d°¢ d*y, is located at (—Dq XK 5 + 6Dq X 3) = —2Dgq. These energy 
levels are shown in Fig. 7(d). These transitions became allowed as a 
result of lattice vibrations®* and hemihedral distortion.” 

The available absorption spectra for some of the oxides are shown in 
Fig. 8, together with the location of the absorption maxima of a hy- 
drated salt of the cation in aqueous solution. The close correspondence 
between maxima found in hydrated complex and oxide for Fet+*+ and 
Ni** indicates that the cation is isolated in the oxide as it is in the 


hydrated salt, and suggests that the 3d wave functions do not overlap 


in the oxides of the metals iron ayd nickel. The spectra of Cr.Q;, how- 
ever, show a slight shift to frequencies lower than that of the hydrated 
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Fig. 7 Positions of some excited states above the ground state for ions with 
electron configurations d'! to d*. In each diagram the free ion is to the left and 
labeled with appropriate term symbols, the ion in a cubic field is to the right. 


salt, indicating a slight overlap of the 3d wave functions. This shift 
becomes very large in the spectra of TisO;, probably indicating that the 
cations in this oxide are not isolated as in their hydrated salts, and 
may be due to the overlap of 3d orbitals, with the formation of a 3d band. 

From the spectra, the energy level diagrams of Fig. 7, and the values 
for the free ion excitation energies, X (from Ref. 28), the information 
given in Table I is obtained. The observed frequencies in the oxides, 
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Fig. 8 — Absorption spectra for TixO; , CroO; , FexO; and NiO compared to the 
location of absorption bands for hydrated salts of the corresponding cations. 
(Ti.O; spectra after Pearson, Ref. 32; CroO; spectra after D. S. McClure, unpub 
lished; hydrated salt spectra after Holmes and McClure, Ref. 43.) 


shown in Column 3, were used to calculate the values of Dg in Column 4. 
In turn, these were used to obtain the calculated frequencies in the final 
two columns. 

Another bit of information can be obtained from the absorption 
spectrum of Fe.O; containing titanium, Fig. 8, which shows an addi- 


tional absorption band appearing at 6100 cm~! when one per cent ti- 
tanium is added substitutionally. Titanium is a donor in FesQ; and, in 
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TABLE | SPECTRA OF OXIDES AND HypRATED SALTS 


Type of Frequencies 
Comparison 
of Spectra »1(cm-) Da(cm™) ( 1) v3(cm™') 
exp. 6000 
eale. 600 
aq. 20300 
aq 17800 25700 . 
exp. 16800 21300 > 27000 
cale, 24000 30000 
aq. 17600 24700 
exp. 12200 > 15500 
aq 12600 18200 24500 
exp. 8600 15400 > 22000 
calc, S60 15500 26000 
aq. 8600 14700 25500 


Comparison of spectra of oxides: exp. (measured); cale. (calculated from oxide 
data and diagrams Fig. 7); aq. (measured on hydrated salts). Data for the hy 
drated salts were taken from O. G. Holmes, and D. 8. MeClure.* 

Note: 1 ev = 8100 em™ 


this concentration, it is nearly all ionized to give Tit+*+*+ and Fe** ions. 
This absorption band, therefore, is probably due to the Fe*+ ions, so 
that, from this information, Dg is 610 em~ for the Fe+*+ environment 
in Fe,.O;. Now the value of Dq for Fe+* in Fe(H.O),s** is 1000 em™ 
which can also be taken to be the value of Dq in FeO. The smaller value 


, 


found for the Dq of Fe** in FeO; suggests that the anions surrounding 


the Fe** ion are at a greater distance from the Fe+* ion in Fe.O; than 
in FeO. This is just what one would expect to find if the conduction 
electron (Fet*) in Fe.O; polarized (repulsive polarization) the surround- 
ing lattice. 


IV. THE 3d BAND 


The inner electronic configuration of the 3d metals is that of argon. 
As electrons are added to the argon core to build up the series, the nuclear 
charge is also increased. The electron cloud is thus subjected to an 
increasing nuclear charge but, on the other hand, the mutual repulsion 
of the electrons increases. The over-all decrease in ionic radius going 
from Cat+(r = 0.99A) to Zn*+(r = 0.83A) indicates that nuclear 
attraction predominates over mutual repulsion, and the electron cloud 
becomes contracted. Therefore, if 3d wave functions overlap sufficiently 
to form a band in any 3d oxides, such band formation is to be expected 
at the beginning of the 3d series. Since, in fact, oxides such as CroO; , 
FeO; and NiO normally are insulators, it is in the oxides of Se, Ti and V 
that a 3d band might be expected. In the previous section it was sug- 
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Fig. 9 — The rapid decrease in ionic radius with increasing atomic number for 
the first few members of the 3d series. This implies a contraction in the 3d wave 
functions and, therefore, a decrease in the amount by which they may overlap to 
form a 3d band. (lonie radii from Goldschmidt, Ref. 44). 


gested, on the basis of optical absorption, that there was evidence that 
such might be the case in TipO; . A comparison of the ionic radii of the 
trivalent cations plotted in Fig. 9 shows how rapidly the radius (and 
therefore the extent of the 3d wave functions) decreases with increasing 
atomic number in the first three members of the series. The abrupt 


+ 


increase in the radius of Mn*+++ does not contradict this idea, since it is 
due to the addition of the first dy electron. This, since it interacts with 
the electron cloud of the anion, increases the effective radius of the 


9 


cation.”® Since no transport information exists for Se.O;, the data for 


the oxides of Ti and V will be examined for evidence of a 3d band. 


4.1 TiO, (Configuration d°) 


In TiOs , the electronic configuration of Ti*++*+* is 3d°, so that if a 3d 
band exists we expect it to be normally empty. We have, however, the 
possibilities of supplying electrons to the 3d band by excitation from 
imperfections or from the filled 2p band. Hall mobility of conduction 
electrons has been measured by Breckenridge and Hosler* in ceramie and 
single-crystal samples of TiO, (rutile). The electrons were thermally 
excited from lattice defects introduced by slight reduction of the samples. 
Room-temperature mobility ranged from 0.1 to 1.0 em?/volt-see for all 
samples, the crystals tending to have the higher values. The mobility 


decreased in a normal way with increasing temperature. This mobility 
is 100 to 1000 times smaller than the electron mobility found in ZnO, 
and is precisely what is expected from transport in a 3d band. 
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Fig. 10 — Photoconductivity in stoichiometric single crystal rutile. (After 
Cronemeyer, Ref. 31.) 


Conductivity, photoconductivity and optical absorption have been 


measured by Cronemeyer* on pure stoichiometric single crystals of 
rutile. These data indicate an intrinsic forbidden energy gap of 3.05 ev. 


Above 1200°K, however, the conductivity temperature dependence 
suggests an increase in the energy gap to 3.67 ev. Presumably, the 3.05 
ev represents an electronic transition from the filled 2p band of the oxygen 
lattice to the empty 3d band, which was indicated by Hall mobility to 
be the conduction band. The increase in the energy gap by as much as 
0.6 ev at high temperature, suggests two possibilities: either the 4s band 
of Tit+*** is near the upper edge of the 3d band, and transitions to that 
band are also important, or the 3d band is split into two sub-bands. 
Discussion in Section V shows that the 4s band is probably too high in 
energy to play a role in conduction in the oxides. The second alternative 

splitting of the 3d band into de and dy sub-bands—seems to be the 
most likely. 

The photoconductivity data, Fig. 10, only extend to 3.4 ev, pre- 
sumably not high enough in energy to show the peak of the dy band. 
At room temperature, photoconductivity shows a single absorption 
band having a maximum at 3.0 ev, probably the de band, and a broad 
shoulder at higher energy which may be the beginning of the dy band. 
The width of the main absorption band seems to be a few tenths of an 
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electron volt, which is very reasonable for a 3d band. At 103°K, this 
band gives an indication of splitting; this is not surprising, since the 
cation in the rutile structure is not at the center of a regular octahedron 
of anions. 


4.2 TiO (Configuration d?) 


Besides TiO, , the only oxide for which there is fairly clear evidence 
of a 3d band is TiO. This oxide behaves like a metal, indicating the exist- 
ence of a partially filled 3d band. According to Pearson,*? room-tempera- 
ture conductivity is 3500 ohm~'cm™ and increases slowly with decreasing 
temperature. Conductivity also decreases in a ratio of 7:12 when the 
oxygen content is increased from TiOQpo.; to TiO;.2 . This is to be expected 
if the effect of changing the stoichiometry is to produce an equal change 
in the number of 3d electrons in the 3d band. If all the 3d electrons, 
6 X 10” for stoichiometric TiO, are assumed to be involved in con- 
duction, the electron mobility can be calculated to be np = o/qn = 3500 
(1.6 X 10-'° X 6 X 10”) = 0.36 cm*/volt-sec at room temperature. 
This is comparable with the mobility in TiO» and suggests conduction 
in a 3d band in this case also. 


4.3 TixO; (Configuration d') and V.O; (Configuration d*) 


Since these oxides contain, respectively, one and two 3d electrons per 
cation, metallic conduction is expected if a 3d band exists in them, just 
as in the case of TiO. Conductivity data for these oxides are shown in 
Figs. 11 and 12. The outstanding fact about these data is the large dis- 
continuity in conductivity at 7'y for a number of samples. (The lack of a 
discontinuity in some cases will be discussed later.) No other oxides 
have shown such behavior, and it is this behavior which suggests an 
energy band scheme that explains fairly well the information available 
for these oxides. To begin with, one expects the 3d band to be split by 
the crystal field into the dy and de sub-bands. It is proposed, further, 
that the de band may be split by the magnetic exchange interaction into 
a lower filled band and an upper empty band. Thus, in a pure stoichi- 
ometric sample at temperatures below 7'y , the Fermi level would be 
midway between the occupied de band and the empty de band. When 
the temperature of the sample is raised through 7’y , the bands would 
collapse into a single partially filled band and cause metallic conduction. 
Sample A of V.O; behaves in this way. If it is assumed that carrier 
mobility does not change very much on going through 7'y , and that the 
conductivity change is due only to the change in carrier concentration 


resulting from the collapsing of de bands, the amount of discontinuity 
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Fig. 11 — Conductivity of TisO; through the Néel temperature. (Sample A, 
Foéx and Loriens, Ref. 45; sample B, Pearson, Ref. 32.) 
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Fig. 12 — Conductivity of V.O; through the Néel temperature. (After Foéx, 
Ref. 46.) 
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to be expected in the conductivity of a pure stoichiometric sample can 
be estimated. This will be simply a factor of exp (2e/kTy), where Ey is 
taken to be the slope of the conductivity curve below Ty . For VO; 
sample A, this factor is exp (0.2/8.65 & 10~° & 170) 8 X 10°, which 
is very close to the measured discontinuity. For Ti,.O; sample A, the 
factor is exp (0.11/8.683 & 10-° X 470) 15, again very close to the 
observed discontinuity, although this sample does not show metallic 
conduction above 7'y . A remarkable point about these two cases is that 
the Fermi energy is changed at the discontinuity by an amount con- 
siderably greater than the exchange energy kT7'y . 

If a sample departs from stoichiometry several effects can be imagined 
which would work to obscure the discontinuity: the odd, or missing, 
cations in the lattice will tend to decouple the spin system; they may 
break up the periodicity of the lattice and disrupt the 3d band; they 
will alter the conductivity by acting as donors or acceptors. Samples 
A and B of TisO; and V.O; sample B probably show these effects in 
varying degrees. With the band disrupted, transport probably occurs as 
in oxides such as NiO, and metallic conduction does not appear above 
Ty. The presence of donors or acceptors increases the conductivity 
below 7'y and minimizes the effect of the magnetic transition on the 
conductivity. 

An estimate of electron mobility for V2O; sample A above T'y gives 
wh = o/qn = 60/(8 X 102 « 16 X 10°) = 5 X 10° em?/volt-sec. 
Thus, mobility in a single crystal may be of the order of 10~°em?/volt- 
sec, Which is more than an order of magnitude less than in TiQs so that 
the 3d band in V.Q; is probably not much greater than kT in width at 
temperatures above T'y . 

It seems reasonable to conclude from the considerations and results 
of this and the previous section that a 3d band exists in the oxides of 
scandium, titanium and vanadium, which probably goes to essentially 
zero Width in chromium oxide. A schematic of this idea is shown in 
Fig. 13. 


V. RELATIVE ENERGIES OF THE 48, 3d AND 2p LEVELS 


We saw in the last section that, of the 3d oxides, probably only those 
of Se, Ti and V are like ordinary semiconductors in that their charge 
carriers are in energy bands. When we come to Cr2O; and the remaining 
3d oxides we find an entirely new situation. The 2p and 4s bands are 
quite comparable to the bands in ZnO, for example, and can be repre- 


sented on the usual one-electron energy diagram. However, the energy 


of conduction electrons and holes in the localized 3d levels must be 
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Fig. 13 — Schematic showing how 3d bandwidth goes to zero and effective 
mass, m*/m, of charge carriers increases as 3d wave functions contract with in 
creasing atomic number. 


represented in a different way, since they are in excited states of the 
cation. 

It is possible to find, in a very approximate way, the relative energies 
of the unbroadened 4s, 3d and 2p levels in an oxide by using data from 
the Born cycle, and spectroscopic data for the free ions and neglecting 
lattice polarization about an electron or hole. In Fig. 14, NiO is taken as 
an example to show how this is done. 

Charge carriers are produced by the following reactions, in which the 
electronic states represented are separated by large distances in the 
erystal: 

O--(2p*) + Nit+(3d*) > O-(2p5) + Nit(3d®), 
Nit+(3d*) + Nit+(3d8) — Nit*+(3d7) + Nit(3d?), 
NI*++(3d’) — Nit*3d74s Nit++(3d’7) + 4s electron, 
V-Nit++(3d’) + Nit+(3d’) -— V-Nit*+(3d*) + Nit**(d*), 
Lit Nit**+(3d") + Nit*+(3d’) — Lit Nit*(3d8) + Nit*+(3d*). (8) 


These reactions represent: (4) the formation of a hole in the 2p band and 
an electron in the 3d levels; (5) the formation of an electron-hole pair 
in the 3d levels; (6) the excitation of an electron to the 4s band and 


formation of a hole in the 3d levels; (7) the trapping of an electron 
at a lattice vacancy acceptor with the formation of a hole in the 3d 
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oO” Nitt om { a U 
Fig. 14 — Lattice of NiO with Ni*** acceptors near a cation vacaney and a 


substituted Lit, Ni* electron and Ni*** hole, and a O7 hole in the 2p band 


levels; and (8) the trapping of an electron at a Li* acceptor site with the 
formation of a hole in the 3d levels. Consider first the energy necessary 
to remove an electron from O-~ and place it on a distant Ni** in the 
lattice, as shown by reaction (4). The total electron affinity of an oxygen 
atom (O going to O-~), determined by the Born cycle, is repulsive and 
equals —7.5 ev.* The affinity of O for the first electron is +2.2 ev;* 
hence the negative affinity for both electrons is due to the affinity for 
the second electron (O- — O-~), which therefore amounts to —9.7 ev. 


This energy is plotted in, Fig. 15 relative to the 2p* level plus a free 


electron on the left of the energy diagram. The ionization potential of 
Nit(8d°) and the other spectroscopic data come from Ref. 28. This 
ionization potential is 18.2 ev, and it is plotted relative to the 3d° level 
in the diagram. (These energies are plotted with reversed signs, since in 
the process being investigated an electron is to be taken from O-~ and 
placed on the Ni**, and this is the reverse of the process for which the 
energies are quoted.) When the ions are brought together to form the 
lattice, the potential at the position of a positive ion due to all the other 
ions in the lattice is negative, and that at the position of the negative 
ion it is positive. As a result, the O~~ level is depressed and the Nit 
level is raised (as shown in the diagram) by the amount of the Madelung 
potential® of NiO, which is 23.8 ev. Thus we find these levels moved to 
—14.1 ev (2p*) and +5.6 ev (3d°). The difference in energy of these two 
levels, 19.7 ev, is the energy of reaction (4), and it was arrived at by 
considering only free ion potentials, which are a measure of the energy 
necessary to remove an electron from a free ion out to infinity. In this 
process, the electron would experience the potential of the ion all the 
way to infinity. In the lattice, however, the process is carried out in a 
dielectric, so the energy required for the reaction is reduced by the di- 
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Fig. 15 Energy level diagram calculated for NiO and showing the energy 
required for the reactions (4) through (7). 





electric constant of NiO. Taking this constant to be about 5, the sepa- 
ration of the two levels is thus reduced to 19.7/5 = 3.9 ev, as shown on 
the diagram to the right in Fig. 15. This energy is probably reduced 
even further by broadening of the 2p levels into a 2p lattice band, so 
that the energy required to produce a free hole in the 2p band and an 
electron in the 3d levels is expected to be less than 3.9 ev, in NiO, 


The next process of interest, given by reaction (5), is to take an electron 
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from a Ni** and place it on another Ni**+ far removed in the lattice. 
The 3d° level is located at —36.2 ev, with respect to the 3d’ level in the 
left of the diagram. Adding to this the Madelung potential raises it to 
—12.4 ev. This lies 18.0 ev below the 3d* level calculated in a similar 
manner in the preceding paragraph. In the lattice, therefore, the transi- 
tion will require 18.0/5 = 3.6 ev. Thus, on the right-hand side of the 
diagram the 3d° level is plotted 3.6 ev below the 3d* level which had 
been located in the first calculation. 

Reaction (6) represents the transition of an electron from a Ni** into 
the 4s band. The first step in calculating the energy required for this 
process is to take an electron from the ground state, 3d°, to the first 
excited 48 state, given by 3d’4s. The location of this excited state with 
respect to the ground state is plotted to the left of the diagram, at 
— 29.6 ev, and in the center, at —5.8 ev, by adding the Madelung po- 
tential, which places it 6.6 ev above the 3d° level. Thus, it required 6.6 
ev to produce this excited state of the cation, which is a bound hole- 
electron pair. The additional energy required to separate this pair 
(essentially to remove the electron from the field of the cation) is given 
approximately by ¢/ea, where ¢ is the dielectric constant and a is the 
distance between cations. This energy amounts to 1.0 ev, so that the 
energy required to free an electron from a cation and place it in a band 
of 4s states is 7.6 ev. This is plotted with respect to the 3d° level at the 
right of the diagram. Due to broadening of the 4s levels into a band, the 
actual separation will be reduced to something less than 7.6 ev. 

Consider now the process (7), in which excess oxygen in the NiO 
lattice introduces vacant Nit* lattice sites, which act as acceptors and 
produce conduction holes in the 3d levels. The relative location in energy 


of such an acceptor level has been calculated by de Boer and Verwey,’ 
although they neglected the effect of the dielectric constant of the solid. 
Their argument is as follows. Each vacant Ni** site is surrounded by 


twelve Nit* ions, and the situation to be considered is that in which two 
of these ions yield an electron each to form Nit+* ions that compensate 
for the absence of the Ni**+ ion. This is shown in Fig. 14. Each Nit** is 
surrounded by a normal lattice minus one Nit* at a distance av 2/2, 
plus one positive charge (Nit++) assumed to be at av/2, the greatest 
separation from the first Ni***. Hence, the potential at.one Ni*+** site 
amounts to the normal Madelung potential plus 2¢@/aVv/2/2 — 
g/avV/2 = 23.8 + 9.8 — 2.4 = 31.2 ev. Thus, the level of this ion is 
raised 7.4 ev above that of the normal 3d° ions, as shown in Fig. 15. 
Considering the dielectric constant, the energy required to transfer an 
electron from a distant Ni**+ ion to a Ni+**+ next to a vacancy will be 
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7.4/5 = 1.5 ev. This level is plotted to the right in Fig. 15 with respect 
to the 3d° level. On this same basis, in reaction (8) a substitutional Li* 
will produce a Ni*** acceptor level at 0.5 ev above the 3d° level. 

The calculated energies involving four of these reactions are shown by 
vertical arrows to the right of the diagram. They will be used in Sections 
VI and VII as guides in analyzing the data for NiO and Fe.O;. It will 
be seen in Section VI that these calculated energies are probably larger 
than the actual energies found in NiO by a factor of approximately two. 
Because of this, it is suggested that the dielectric constant of NiO is 
nearer to ten than to the value of five assumed in these calculations. 

The Madelung potential is about the same for all the monoxides CuO 
through TiO. However, as we go from CuO to MnO, the second and 
third ionization potentials of the cation become progressively smaller. 
This raises the 3d levels in energy with respect to the 2p band. In MnO, 
for example, the 3d° level is about 4.5 ev above the 2p band, as compared 
with a separation of 3.9 ev for the 3d° level in NiO, (assuming a dielectric 
constant of five in both cases). Therefore, conduction by 2p holes may be 
less important in the oxides which are earlier in the series than is NiO. 

Madelung constants are not available for the individual ions in the 


rhombohedral oxides where anions and cations do not occupy equivalent 
lattice sites. Therefore, the relative positions of the 3d levels and the 2p 


band in these oxides cannot be determined easily. 

The location (as given in Ref. 22) of the 3d’4s state with respect to 
the ground state in the free ion indicates that the 4s band becomes 
farther removed from the 3d levels as the charge on the cation is in- 
creased. Thus, the 4s band probably plays no role in conduction in any 
of the oxides having cations with a charge +2 or greater. The energy 
of the 3d—3d transition is given by the difference in ionization energy 
of the ground states, 3d" and 3d"*', divided by the dielectric constant 
of the oxide, as shown above for NiO. The only sesquioxide of interest 
here for which spectroscopic data are available is Cr.O; (which is one 
that does not have a 3d band), for which the interval 3d* — 3d‘ is 18.6 
ey. This is about the same as 3d* — 3d* in NiO. 

The simple oxides, then, when pure and stoichiometric, can be expected 
to show conduction involving 3d holes and electrons and 2p holes, as a 
result of the processes shown in (4), (5) and (6). These are the processes 
corresponding to intrinsic conduction arising from excitation across the 
forbidden gap in the semiconductors discussed in earlier chapters. They 
will probably appear to be p-type, because conduction by high-mobility 
2p holes will predominate over the conduction by low-mobility electrons 
in the 3d levels. On the other hand, when an oxide is impure or departs 





OXIDES OF THE 3d TRANSITION METALS 1071 


from stoichiometry, as in (7) and (8), its type will depend, in addition 
to this intrinsic behavior, upon the particular valence state of the cation 
and upon the valence states available to the cation. It was shown above 
how excess oxygen or substitutional Lit in NiO introduced Nit*** ac- 
ceptor levels near to the 3d° levels. In TiO», a departure from stoichi- 
ometry produces Tit+** ions which are donors with respect to Ti**** ions 
and introduce donor levels near to the 3d band. In general, when the 
cations are in their lowest valence state, a defect will be such as to cause 
the oxide to be p-type and, when the cations are in their highest valence 
state, a defect will be such as to cause the oxide to be n-type. Oxides such 
as Fe,O;, in which the cations are in an intermediate valence state, 
can be made to go either into n-type or p-type. However, at high tem- 
peratures even impure oxides will tend towards p-type, because of con- 
duction by high mobility 2p holes. The energy levels discussed in Section 
III, of course, exist in addition to those shown in Fig. 15, but they are 
excited states which do not contribute to conduction. 


VI. NICKEL OXIDE, NiO 


Much more information is available for NiO than for the other cubic 
oxides, so it will be considered in detail. The structure of NiO has been 
described in Section IT, and a possible electronic energy level scheme has 
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Fig. 16 — The resistivity of NiO as a function of Li content. (Sample o, Morin, 
Ref. 37; sample x, Verwey, Ref. 47.) 
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been discussed in Sections IV and V. Nickel oxide is a p-type semicon- 
ductor, apparently even when pure and stoichiometric, and no n-type 
form has been reported. Room-temperature resistivity for pure NiO 
is approximately 10 ohm-em. This can be lowered to about 1 ohm-cm 
by the addition of Li, as shown in Fig. 16. 

The transport data available for NiO are from Wright and Andrews,** 
taken on oxidized pure Ni films, and Morin,” taken on sintered bars 
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Fig. 17 — The conductivity of NiO (samples 1, B; and E) and of NiO contain 


ing Li (samples 4, 5 and 6; see Fig. 16). (Samples 1, 4, 5 and 6, Ref. 37; samples 
B, and E, Ref. 36.) 
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containing different amounts of Li and also presumed to vary in oxygen 
content. The data are shown in Figs. 17, 18 and 19. Samples | and B, , 
judging from the method of preparation (see Refs. 36 and 37), their 
color (pale green) and their electrical behavior, are considered to be the 
purest and most nearly stoichiometric. Conductivity and thermoelectric 
effect of the other samples approaches that for these at high tempera 
tures. This occurs in a way typical of semiconductors when impurity 
centers become completely ionized and intrinsic conduction predomi- 
nates. In the intrinsic range, both Hall effect and thermoelectric effect 
remain p-type. 

According to the calculations of Section V, the energy level diagram, 
Fig. 20 can be constructed. Three parallel conduction processes are 


expected for NiO, from reactions (4) and (5): 61, high-mobility 2p 


holes; a2 , low-mobility 3d holes; and a; , low-mobility 3d electrons, so 
that total conductivity is given by 


g=o+02.+ 063. (9) 


Carrier concentration will be given, choosing /; = 0 as the zero of energy, 














90 600 
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Fig. 18 — The Seebeck effect of NiO (samples 1, B: , and E) and NiO contain 
ing Li (samples 4, 5 and 6; see Fig. 16). (Samples 1, 4, 5 and 6, Ref. 37; samples 
B, and E, Ref. 36.) 
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Qam*kT 


p = of ) exp (— E,/kT), (10) 
l 


p2 = N exp[— (Ep — E,)/kT), (11) 
nz = N exp[— (E; — E,)/kT], (12) 


where N is the number of cations per cu cm. No Hall effect is expected 
from carriers pe and n; in localized 3d levels, and none has been observed. 


Under conditions of sufficiently high purity and stoichiometry, or at 


elevated temperature where an appreciable fraction of the current is 
carried by 2p holes, a Hall effect due to these holes is expected. Since 
such a Hall effect is due only to 2p holes, but the current is carried by 
three kinds of carriers, the Hall coefficient is given by 


Ry Pima ai (13) 
"(pi pa + ps wo + ns ms)?q 
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Fig. 19 — The Hall coefficient of NiO. (Samples B: and E, Ref. 36.) 
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Fig. 20 — Energy level diagram proposed for NiO. 


Because the 3d carriers give no Hall effect, their behavior has been 
studied using another tool, the Seebeck effect. In interpreting the See- 
beck effect of the 3d oxides it is assumed that 3d carriers transport no 
kinetic energy, since they spend most of their time trapped on lattice 
sites. Therefore, there is no phonon drag effect associated with them, 
and their Seebeck effect is due only to their separation in energy from 
the Fermi level. For simplicity, it is assumed that there is no phonon 
drag effect associated with 2p holes. This assumption is probably a good 
one, since most of the data have been taken at elevated temperatures 
where this effect is at a minimum. The 2p holes are assumed to carry a 
kinetic energy of 2k7. The over-all thermoelectric effect produced by 
three kinds of carriers is given by 


OTe = Q:To + Q2To> —_ Q;To3 (14) 


QTo = (Ep — Ey + 2kT)o, + (Ee — E2)o2 — (F3 — Ep)o3. (15) 


The problem is to determine three carrier concentrations and three 
mobilities from only three independent experiments. This complicated 
‘alculation will not be attempted here. Rather, two extreme situations 
will now be assumed: (a) NiO doped with so much Li that only 3d 
holes contribute to conduction, and (b) NiO so pure that 2p holes pre- 
dominate in conduction. 

In Fig. 16 it is shown that the conductivity changes in direct propor- 
tion to the amount of Li added, for high concentrations of Li. One infers 
from this that each Li added contributes one hole to conduction and that 


nearly all the impurity centers are ionized. Thus, conduction is entirely 
extrinsic and due to 3d holes. Under these simplified conditions, (9) 
and (14) reduce to 


(16) 
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and 
QT (EL, _ Fs). ( 7 ) 


Knowing now, from (17), the location of the Fermi level with respect to 
the 3d° levels, the concentration, ps , of 3d holes can be calculated from 
equation (11) if the density of states, V, available to the holes is known. 
This is assumed to be the number of cations per cu cm of crystal, which is 
5.6 & 10” in NiO, since we are dealing with holes localized on the cations 
rather than in an energy band. 

Now, with the measured ¢ and the hole concentration calculated in 
this way from QT for sample 6, the mobility, ue , of the 3d* holes has been 
calculated and is plotted in Fig. 21. Both in magnitude and in tempera- 
ture dependence this result is in accord with the ideas concerning trans- 
port in highly localized 3d orbitals discussed in Sections I and VIII. 
Mobility at 300°K is 0.004 cm?2/volt-sec and increases exponentially with 
temperature, with an activation energy of 0.10 ev. At low temperatures, 
QT (sample 6) is constant with temperature, at 0.10 ev. If the Fermi 
level lies halfway between the 3d° levels and the acceptor states, this 
would indicate then that the [Lit Ni***] acceptor levels are 0.20 ev 
above the 3d° levels in this sample, as compared with the value of 0.5 
ev calculated in Section V and shown in Fig. 15. This suggests that a some- 
what higher value should have been closen for the dielectric constant 
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Fig. 21 The mobility of holes in the 3d° levels of NiO obtained from measure 
ment of conductivity and Seebeck effect. (See Fig. 16 for composition of samples 
5 and 6.) 
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in these calculations. On the other hand, at the high density of defects 
existing in sample 6, interaction among defects may occur, and this 
would reduce their ionization energy. 

A calculation of pe. and we has been made for sample 5, in which the 
[Li*] is not in the range where c2 is directly proportional to [Lit] (see 
Fig. 16). The mobility results are shown asa dashed line in Fig. 21. At 
temperatures for which 1/7’ is >1.5 X 10-* they are parallel to, but 
much lower than, the results for sample 6; at higher temperatures they 
rise rapidly with temperature. It is assumed that transport of holes in 
the 3d° levels consists of Ni+* states drifting by jumps from one cation to 
a neighbor cation under the influence of the applied electric field. There- 
fore, it seems likely that mobility for this type of transport will be 
independent of impurity concentrations. In fact, this is suggested in 
Fig. 16 by the direct proportionality between conductivity and Li con- 
centration for high-Li concentration. Consequently, the results on sample 
5 probably mean that conduction in the 2p band is appreciable in this 
sample and cannot be neglected in any sample containing less than a 
3 X 10 atom fraction of Li (or approximately 2 X 10° Li atoms 
cm‘). This suggests, further, that the 2p band is not much farther below 
Ey than are the 3d° levels, in accord with the energy level diagram in 
Fig. 15. 

Now assume the second situation, pure NiO, with only 2p holes in- 
volved in conduction and yw; = Ryo. This result for sample EF is shown 
in Fig. 22. Also, from this assumption (15) reduces to 


QT = (Kye + 2kT). (18) 


From this, p: can be calculated from (10), assuming m*/m 1. Since 
o1 = @, wm can again be calculated. This has been done for samples | 
and B, and is also plotted in Fig. 22. There, the «4; results are compared 
with yz, , a lattice-scattering mobility considered reasonable for carriers 
in the wide band of an oxide. The high values found for 4; support the 
idea that 2p holes are involved in the conduction. The temperature 
dependence that is found is quite far from a reasonable one, showing that 
the other conduction processes also play a role in these samples and 
cannot be neglected. These results, then, do not represent true 2p hole 
mobility. They do indicate, however, that the energy level scheme pro- 
posed for NiO is a reasonable one. With this model in mind, the QT 
results of Fig. 18 can be understood in a qualitative way. At low tempera- 
ture, Ey is midway between FE, and EF, , the acceptor levels, so that 


o = 0; + o2. As the temperature is raised, Ly moves upward through 


FE’, as acceptors become completely ionized, and QT increases accordingly. 
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Fig. 22 — The mobility of holes in the 2p band of stoichiometric NiO obtained 
from measurement of conductivity Hall effect and Seebeck effect. 





Finally, at high temperatures, all the acceptors become ionized in the 
purest samples, Ey lies nearly midway between EF, and FE; , and QT be- 
comes constant with temperature. Under these conditions, QsTa. — 
();:To; approaches zero, and QT is p-type, due largely to 2p holes. At 
high temperature, measured Q7 for samples 1 and B, is about 1 
ev. Furthermore, the slope of the conductivity and Hall coefficient 
plots for these two samples is also about 1.0 ev at high temperature. 
These results suggest that the 2p band is about 1.0 ev below the Fermi 
level in the intrinsic situation and, therefore, that the energy levels of 
Fig. 15 should be scaled down by a factor of one-half. This is also in 
fair agreement with the finding that the [Lit Ni***] acceptor level lies 
0.2 ev above the 3d° level rather than 0.5 ev above, as was calculated. 
Evidently the value of ten would have been a better assumption for the 
dielectric constant of NiO. 


VII. IRON OXIDE, Fe,O; 


Magnetic and structural properties of the rhombohedral sesquioxide 
of iron have been described in Section II. Its semiconductor behavior is 
very similar to that of NiO, except that Fe.O; can exist as either n- or 
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p-type because the cation can take on a lower as well as a higher valence 
state. Oxidation causes pure Fe.O; to become p-type, probably due to 
the production of Fe*+*+** ions in the vicinity of cation vacancies. Re- 


duction causes pure Fe,O; to become n-type, probably due to the pro- 


duction of excess cations in the form of Fet* ions. With respect to the 
normal lattice of Fe+*+* ions, Fe**** is a hole and Fe** an electron. 
The room-temperature resistivity of Fe2O,; is approximately 10 ohm-cm. 
This can be lowered to about 4 ohm-cm by the addition of Ti. It is known 
from an X-ray study®* that, in the isomorphous compound FeTiQ; , 
the cations are Fet* and Ti**+** at room temperature. Presumably 
then, Ti++*+ in FeO, also ionizes, to give a 3d electron (Fe*++) and a 
Tit+*+++ ion. The change in resistivity with added Ti is shown in Fig. 23. 
At high Ti concentrations the resistivity does not decrease with added 
Ti. This may be due to a shifting oxygen equilibrium with Ti content, 
which introduces acceptors that compensate for some of the donors. 

Transport data! ** available for Fe,O; have been obtained on sintered 
bars, and consist of conductivity, Fig. 24, and thermoelectric effect, Fig. 
25. Samples 1 and A are identical in conductivity, but the thermo- 
electric effect shows that, due to the different conditions of sintering, 
sample 1 is p-type and sample A is n-type. The QT data also show that 
both of these samples tend to be p-type at high temperatures, which is 
in accord with (14) and the idea that the general energy level scheme 
which was developed in Section V for NiO is probably applicable to 
most of the simple oxides. However, in Fe,O; , QT is smaller in the in- 
trinsic range than it is in NiO. This suggests that the 3d levels in FeO; 
are higher above the 2p band than they are in NiO. 
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Fig. 23 — The resistivity of Fe,O; as a function of Ti content. (Ref. 18). 
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Fig. 24 — The conductivity of FesO; (samples 1, A and 2) and FeO; containing 
Ti (samples 3 and 4; see Fig. 23 for composition). 


The situation in FeO; , where both donors and acceptors are present, 
is somewhat more complicated than in NiO. However, the simple case 
in which FeO, is so heavily doped with Ti that conduction is only by 3d 
electrons can be dealt with. Such a situation is probably represented by 
sample 4, in which 

G = 63, (19) 


QT = (EL; = Ey), (20) 


and n; is given by (12), where N = 4.0 X 10” cm-*. From these equa- 
tions and the data for sample 4, u; has been determined. It is found to 
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Fig. 25 — The Seebeck effect of Fe:O; (samples 1, A and 2) and FesOQ; contain 
ing Ti (samples 3 and 4; see Fig. 23 for composition). 


be identical with the hole mobility for NiO sample 6, shown in Fig. 21. 
Thus the mobility of 3d electrons in Fe.Q; is in accord, both in magni- 
tude and temperature dependence, with the idea of transport in localized 
3d levels. At T = 300°K, us = 0.004 em?/volt-sec and increases ex- 
ponentially with temperature with an activation energy of 0.1 ev. 
VIII. TRANSPORT IN THE 3d LEVELS 

In Section V the processes were described by which conduction elec- 
trons or holes are created in the 3d levels. In NiO, for example, a 3d 
hole freed from a cation defect is presumed to consist of a Ni**+* with 
the cation neighbors for some distance around it normal Ni** ions. The 
hole can diffuse about the cation lattice by exchanging an electron with 
a Nit* neighbor until it moves next to an ionized defect site, receives an 
electron and becomes trapped, or until it becomes annihilated by re- 
combination with a free electron. A net drift of such holes occurs when 
an external electric field is applied to the crystal, and thus an electric 
current is obtained. 

In Sections VI and VII the mobility of 3d charge carriers was shown 
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to be very low and to increase exponentially with temperature. These 
results are consistent with a suggestion made 20 years ago by de Boer 
and Verwey’ that conduction in NiO and Fe.QO; occurs as outlined above 
and that a potential barrier exists which must be overcome for a charge 
carrier to jump from one cation to another. The potential barrier is 
assumed to arise because the charge carrier spends a time on a par- 
ticular cation which is long compared to the vibrational frequency of 
the lattice, and this allows the ions surrounding a charge carrier to be- 
come displaced by its electric field. Evidence that this occurs in Fe.O; 
containing Ti was given in Section IIT. This displacement creates a field 
which tends to prevent the charge carrier from leaving its lattice site, 
and so the charge carrier has trapped itself.‘°: 4‘ These ideas can be repre- 
sented by two energy levels, one for a hole, the other for a normal cation 
neighbor, being moved up and down in energy by phonons. Whenever 
the two levels become the same, charge transfer can occur; thus, an 
exponential temperature dependence would be found for transport. 
One might expect this charge transfer to depend upon the electron 
configuration of the cation. Using the diagram in Fig. 6 one can see 
that, if Hund’s rule* is followed, a hole in MnO and CuO will be in the 
dy levels but a hole in FeO, CoO and NiO will be in the de levels. Now, 
since the de orbitals are directed between anions and towards neighboring 
cations while dy orbitals are directed at anions, transport in the two 
levels may be quite different. For example, transport in the dy levels, 
through an anion as in superexchange, will involve spin reversal below 
Ty regardless of crystal orientation. Transport in the de levels will not 
involve spin reversal if it is in the (111) planes of parallel spins but will 


require spin reversal if it is perpendicular in the planes of parallel spin, 
and it so may produce an anisotropy in conductivity, at least within a 


magnetic domain. 

If spin reversal is involved in transport, one might expect the activa- 
tion energy for transport below 7'y to be the sum of the polarization 
energy plus the exchange energy, and to be only the polarization energy 
above 7'y . This would introduce an abrupt change in the temperature 
dependence of conductivity at 7'y . Heikes and Johnston” believe that 
they have seen this effect in sintered samples of the oxides MnO, CoO, 
NiO and CuO which contained a large concentration of Li. However, 
the results described in Section VI for NiO containing Li (sample 6) 
show no change in the slope of mobility (Fig. 20) at 7'y , but do show 


* That the d levels are filled with electrons in such a way as always to main- 
tain a maximum spin multiplicity. 
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that the slope of conductivity is changing in this temperature region, 
because the Fermi level (Fig. 18) is beginning to move upward. It is evi- 
dent that these questions can be answered only by transport studies on 
single crystals. 
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